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PREFACE 


In the fall of 2000, I was assigned to teach history of mathematics on the 
retirement of the person who usually did it. And this with no more reason 
than the historical snippets that I had included in my previous book, Fourier 
Analysis and Boundary Value Problems. | was clearly fond of history. 

Initially, | was unhappy with this assignment because there were two ob- 
vious difficultie from the start: (7) how to condense about 6000 years of 
mathematical activity into a three-month semester? and (ii) how to quickly 
learn all the mathematics created during those 6000 years? These seemed 
clearly impossible tasks, until I remembered that Joseph LaSalle (chairman 
of the Division of Applied Mathematics at Brown during my last years as a 
doctoral student there) once said that the object of a course is not to cover 
the material but to uncover part of it. Then the solution to both problems was 
clear to me: select a few topics in the history of mathematics and uncover them 
sufficient! to make them meaningful and interesting. In the end, I loved this 
job and I am sorry it has come to an end. 

The selection of the topics was based on three criteria. First, there are 
always students in this course who are or are going to be high-school teachers, 
so my selection should be useful and interesting to them. Through the years, 
my original selection has varied, but eventually I applied a second criterion: 
that there should be a connection, a thread running through the various topics 
through the semester, one thing leading to another, as it were. This would give 
the course a cohesiveness that to me was aesthetically necessary. Finally, there 
is such a thing as personal taste, and I have felt free to let my own interests 
help in the selection. 

This approach solved problem (i) and minimized problem (ii), but I still had 
to learn what happened in the past. This brought to the surface another large 
set of problems. The firs time I taught the course, I started with secondary 
sources, either full histories of mathematics or histories of specifi topics. 
This proved to be largely unsatisfactory. For one thing, coverage was not 
extensive enough so that I could really learn the history of my chosen topics. 
There is also the fact that, frequently, historian A follows historian B, who 
in turn follows historian C, and so on. For example, I have at least four 
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books in my collection that attribute the ratio test for the convergence of 
infinit series to Edward Waring, but without a reference. I finall traced this 
partial misinformation back to Moritz Cantor’s Vorlesungen iiber Geschichte 
der Mathematik.’ This is history by hearsay, and I could not fully put my 
trust in it. There is also the matter of unclear or insufficien references, with 
the additional problem that sometimes they are to other secondary sources. 
Finally, I had to admit that not all secondary sources offer the truth, the whole 
truth, and nothing but the truth (Rafael Bombelli, the discoverer of complex 
numbers, has particularly suffered in this respect). The long and the short of 
it is this: it’s a jumble out there. 

After my firs semester teaching the course, it was obvious to me that I 
had to learn the essential facts about the work of any major mathematician 
included here straight from the horse’s mouth. [had to fin original sources, or 
translations, or reprints. I enlisted the help of our own library and the Boston 
Library Consortium, with special thanks to MIT’s Hayden Library. Beyond 
this, I relied on the excellent service of our Interlibrary Loan Department. But 
even all this would have been insufficien and this book could not have been 
written in its present form. I purchased a large collection of books, mostly out 
of print and mostly on line, and scans of old books on CD, all of which were of 
invaluable help. Special thanks are also due to the Gottfried Wilhelm Leibniz 
Bibliothek, of Hanover, for copies of the relevant manuscripts of Leibniz on 
his discovery of the calculus. As for the rest, the very large rest, I went on line 
to several digitized book collections from around the world, too many to cite 
individually. It is a wonder to me that history of mathematics could be done 
before the existence of these valuable resources. 

Many of the works I have consulted are already translated into English 
(such as those by Ptolemy, Aryabhata, Regiomontanus, some of Viéte’s, 
Napier, Briggs, Newton, and—to a limited and unreliable extent—Leibniz), 
but in most other cases the documents are available only in the language orig- 
inally written in or in translations into languages other than English (such as 
those by Al Tusi, Saint Vincent, Bombelli, most of Gregory’s, Fermat, Fourier, 
da Cunha, and Cauchy). Except by error of omission, the translations in this 
book that are not credited to a specifi source are my own, but I wish to thank 
my colleague Rida Mirie for his kind help with Arabic spelling and translation. 


Vol. 4, B. G. Teubner, Leipzig, 1908, p. 275. Cantor gave the reference, but with no 
page number, and then he put his own misleading interpretation of Waring’s statement in 
quotation marks! For a more detailed explanation of Waring’s test, stronger than the one 
given later by Cauchy, see note 21 in Chapter 6. 
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As much as I believe that a text on mathematics must include as many 
proofs as possible at the selected level, for mathematics without proofs is just 
a story, I also believe that history without complete and accurate references 
is just a story, a frustrating one for many readers. I have endeavored to give 
as complete a set of references as I have been able to. Not only to original 
sources but also to facsimiles, translations into several languages, and reprints, 
to facilitate the work of the reader who wishes to do additional reading. These 
details can be found in the bibliography at the end of the book. For easy and 
immediate access, references are also given in footnotes at each appropriate 
place, but only by the author’s last name, the work title, volume number if 
applicable, year of publication the firs time that a work is cited in a chapter, 
and relevant page or pages. 

I can only hope that readers enjoy this book as much as I have enjoyed 
writing it. 


Dunstable, Massachusetts Enrique A. Gonzdlez-Velasco 
June 15, 2010 


TRIGONOMETRY 


1.1 THE HELLENIC PERIOD 


Trigonometry as we know it and as we call it today is a product of the seven- 
teenth century, but it has very deep roots. In antiquity, it all started with a stick 
sunk in the soil perpendicular to the ground, and with the measurement of its 
shadow in the sun. From repeated observations of the shadow, such things as 
the length of the day or the length of the year could be determined, as well as 
the location in the year of the solstices and equinoxes. Because the Greeks 
would obtain knowledge from the length of the stick’s shadow, they called the 
stick a gnomon (yv@ucv), a Greek word that has the same root as “to know.” 

At present there is a wider interest in plane trigonometry, and it is the 
history of this discipline that we shall outline, but it should not come as a 
surprise that some of the earliest users were primarily concerned with spher- 
ical trigonometry. When humans could fin some time for tasks other than 
procuring food and shelter, when scientifi curiosity was possible and started 
to develop, people looked at the heavens and tried to figur out the mystery 
of the sun, moon, and stars circling around. Thus, astronomy was one of 
the earliest sciences and, together with geography, made the development of 
trigonometry, plane and spherical, a necessity. 

Four astronomers were the main known contributors to the development 
of the firs phase of this subject: Aristarchos of Samos, Hipparchos of Nicza, 
Menelaos of Alexandria, and Klaudios Ptolemaios.' 


1 Except in quotations from other sources, I write Greek names with Greek rather than 
Latinized endings, as in Aristarchos rather than Aristarchus. 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 1 
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Aristarchos (310—230, BCE) was basically a mathematician, and he was 
known as such in his own time. Aétius refers to him as “Aristarchus of Samos, 
a mathematician and pupil of Strato ...”* in his Doxographi greci. 

Aristarchos’ main contribution to astronomy is the proposal of the helio- 
centric system of the universe, but this work has been lost. We know about it 


2 Quoted from Thomas, Selections illustrating the history of Greek mathematics, Il, 1942, 
p. 3. Strato of Lampascus was head of the Alexandrian Lyceum from 288/287 to 270/269 
BCE [p. 2]. This was a school modeled after Aristotle’s Peripatetic Lyceum, so called 
because Aristotle taught while walking in the garden of the hero Lycos. 
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from a reference in The sand-reckoner of Archimedes [pp. 221—222]:° 


Now you are aware [“‘you” being Gelon, tyrant of Syracuse] that ‘universe’ is 
the name given by most astronomers to the sphere, the centre of which is the 
centre of the earth, while its radius is equal to the straight line between the 
centre of the sun and the centre of the earth. This is the common account (ta 
ypagpoueva), as you have heard from astronomers. But Aristarchus of Samos 
brought out a book consisting of some hypotheses, in which the premises 
lead to the result that the universe is many times greater than that now so 
called. His hypotheses are that the fi ed stars and the sun remain unmoved, 
that the earth revolves around the sun in the circumference of a circle, the 
sun lying in the middle of the orbit, and that the sphere of the fi ed stars, 
situated about the same centre as the sun, is so great that the circle in which 
he supposes the earth to revolve bears such a proportion to the distance of 
the fi ed stars as the centre of the sphere bears to its surface. 


As much as the loss of this book and the fact that his contemporaries did not 
accept this theory are to be lamented, the heliocentric hypothesis is not central 
to our quest. 

Aristarchos’ only extant book, On the sizes and distances of the sun and 
moon, is more interesting for our purposes. This is a book on mathematical 
astronomy, with little or nothing on the practical side. After stating six basic 
hypotheses, starting with* 


a That the moon receives its light from the sun. 
and ending with 
¢ That the moon subtends one fifteent part of a sign of the zodiac. 


Aristarchos then gave a set of 18 propositions with proofs. There being twelve 
signs of the zodiac in a full circle of 360°, one sign of the zodiac is 30°, 
which would make the moon subtend an arc of 2°. Now, this is incorrect, 
and Aristarchos himself knew better (although not, apparently, at the time of 
writing the preserved manuscript) because Archimedes said that Aristarchos 
“discovered that the sun appeared to be about 1/720th part of the circle of 


3 This work can be seen in Heath, The works of Archimedes with The method of 
Archimedes, 1912, pp 221—232. Page references given in brackets are to the Dover Publi- 
cations edition, 1953. 

4 The stated hypotheses are in Heath, Aristarchus of Samos, 1913, p. 302. Page references 
given below for additional quotations are to this edition. 
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A sixteenth-century manuscript copy of On the sizes and distances of the sun and moon. 
The header reads: Apiotdpyov nepi weyeO@v Kai dnootnudtav Hiiov Kai ceAjvyc. 
Reproduced from the virtual exhibition El legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla, 2000. 


the zodiac” [The sand-reckoner, p. 353]; that is, one half of a degree, and 
Aristarchos believed that the sun and the moon subtend the same arc (as he 
stated in Proposition 8 [p. 383]). 


His conclusions are erroneous because his sixth hypothesis is, but the 
proofs are mathematically correct. Most are based on what we now call 
Euclidean geometry, but Aristarchos also used what we know as trigonometry. 
For instance, consider [p. 365] 


PROPOSITION 4. 


The circle which divides the dark and bright portions in the moon is not 
perceptibly different from a great circle in the moon. 
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This situation is represented in the next figur (which excludes some of 
the lines and letters of the original). The circle represents the moon with its 
center at B, the point A is the eye of an observer on earth, and the sun, eclipsed 


ZA 


ET 


by the moon, is on the left (not shown). The bright portion of the moon is 
represented by the arc AZET and the dark side by the are AHI. The circles 
mentioned in the proposition are shown as the segments AI and ZE. The 
point © is chosen so that the arc OH is equal to half the arc AZ. 

The proof consists in showing that this last arc is negligible by estimating 
its maximum possible size if the angle AAJ is 2°, and it is mostly geometric; 
but then Aristarchos used the following fact [p. 369]: 


And B@ has to @A a ratio greater than that which the angle BA@ has to the 
angle ABO. 


To interpret this statement, let 7 denote the foot of the perpendicular from © 
to AB (not shown in the figure and denote the angles BA® and ABO by a 
and £, respectively. Then, in our usual terminology, 


orl ; OIT 
and sin B = 


sina = . 
OA Bo 


This allows us to compute the ratio BO/@A and to restate Aristarchos’ state- 
ment in the equivalent form 


sina @ 


: Soi 
sin B B 


which shows its true nature as a trigonometric statement. 
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The remarkable thing is that Aristarchos, who was extremely clear and 
detailed in the geometric part of the proof, offered no comment or explanation 
about the preceding statement. 

Another instance of the use of trigonometry occurs in the proof of 


PROPOSITION 7. 


The distance of the sun from the earth is greater than eighteen times, but less 
than twenty times, the distance of the moon from the earth. 


Actually, the sun is about four hundred times farther away than the moon. 
But this is neither here nor there, for had Aristarchos started with correct values 
from accurate measurements, he would have arrived at correct results. The 
relevance of his work is in showing that the power of mathematics can lead to 
conclusions of the type reached. 

In the course of the proof, which is based on a figur some of whose 


H 


B E 


components are reproduced here, Aristarchos asserted the following [p. 377]: 


Now, since HE has to E@ aratio greater than that which the angle HBE has 
to the angle OBE,... 


It is very easy to translate this statement into our present language: if a and 6 
are angles between 0 and 7/2, and ifa > £, then 


and again this is trigonometry. Once more, Aristarchos offered no comment 
or proof. He simply assumed this as a fact, which suggests that these trigono- 
metric assumptions may have been well-known facts at the time. At any rate, 
they were known to him. 
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Very little is known about Hipparchos (190-120, BCE). The main known 
facts are these: he was born in Nicza, in Bithynia (now called Iznik in north- 
western Turkey, about 100 kilometers by straight line southeast from Istanbul); 
he was famous enough to have his likeness stamped on coins issued by several 
Roman emperors; and he devoted his scientifi life to astronomy, making 
observations at Alexandria in 146 BCE and at Rhodes, where he probably 
died, about twenty years later. All but one of his works have been lost, and 
we have no direct knowledge of his work on trigonometry, even though he 
is usually regarded to be its founder. We know about his research indirectly, 
from other sources. 

One of Hipparchos’ claims to fame is a determination of the solar year that 
is closer to the real one than the one in use for more than 1600 years after his 
finding Comparing the length of the gnomon’s shadow at the summer solstice 
of 135 BCE with that measured by Aristarchos at the summer solstice of 280 
BCE, led him to correct the then accepted figur of 365 ; days in a now lost 
tract On the length of the year. Ptolemaios provided the following evidence 
of this fact in his Mathematike syntaxis:° 


And when he more or less sums up his opinions in his list of his own writings, 
he [Hipparchos] says: ‘I have also composed a work on the length of the year 
in one book, in which I show that the solar year (by which I mean the time 
in which the sun goes from a solstice back to the same solstice, or from an 
equinox back to the same equinox) contains 365 days, plus a fraction which 
is less than i by about aath of the sum of one day and one night, and not, 
as the mathematicians suppose, exactly i-day beyond the above-mentioned 


number [365] of days.’ 


This would put the length of the year at 365 days, 5 hours, 55 minutes, and 
12 seconds. This length is very close to the true value of 365 days, 5 hours, 
48 minutes, and 46 seconds, and represents an opportunity missed by Julius 
Cezesar when he reformed the calendar in 46 BCE and ignored Hipparchos’ 
value (he was counseled by the professional astronomer Sosigenes). 

Trigonometry is supposed to have had its start in a table of chords sub- 
tended by arcs of a circle compiled by Hipparchos. We have this on the 
authority of Theon of Alexandria, who wrote the following in his commentary 
on Ptolemaios’ Syntaxis:® “An investigation of the chords in a circle is made 
by Hipparchus in twelve books and again by Menelaus in six.” 


5 Quoted from Toomer, Ptolemy’s Almagest, 1984, p. 139. 
§ Quoted from Thomas, Selections illustrating the history of Greek mathematics, Il, 
p. 407. 
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While the exaggerated number of twelve books must be in error, the tables 
were real.’ They were based on dividing the circle into 360 parts with each 
part divided into 60 smaller parts, as the Babylonians had already done. It 
appears that Hipparchos evaluated the lengths of chords of angles at 75 parts 
apart and then interpolated values at intermediate points. But how a table 
of chords is constructed will be better explained when we cover the work of 
Ptolemaios, who incorporated that of Hipparchos and provided all necessary 
details. 

There is little to say about Menelaos of Alexandria (c. 70—c. 130) regarding 
plane trigonometry: writing in Rome, he compiled a table of chords in six 
books, but it has not survived. He wrote a number of texts, but the only one 
that has been preserved to our times is an Arabic translation of his Spherica in 
three books. In it, he made noteworthy contributions to spherical trigonometry, 
which we shall not discuss. 

But the fina synthesis of the trigonometric knowledge of antiquity took 
place in Alexandria, the magnificen city founded in Egypt by Alexander III, 
son of Philip II of Macedonia, who had set out at the age of 20, in 336 BCE, 
to conquer the world. He succeeded, indeed, in becoming ruler of most of the 
known world of his time, totally subduing the Persians. The city of Alexandria 
was destined to become the center of Hellenic culture for centuries and the 
undisputed hub of mathematics in the world. On the death of Alexander in 323 
BCE, at the age of 33, Egypt was governed by Ptolemy Soter (Savior), one 
of Alexander’s generals, who was also his close friend and perhaps a relative, 
and who became king of Egypt in 305 BCE. Ptolemy I and his successors were 
enlightened rulers, bringing scholars from all over the world to Alexandria. 
Here, at the Museum, or temple of the Muses, they had a magnificen library 
at their disposal, botanical and zoological gardens, free room and board in 
luxurious conditions, exemption from taxes, additional salaries, and plenty of 
free time to engage in their own scholarly pursuits. Their only duty was to give 
regular lectures, a situation equivalent to that of a very generously endowed 
modern university plus quite a few additional perks. 

Ptolemy I founded a city that was called Ptolemais, named after him, as 
a center of Hellenic culture in upper Egypt, and it was here that Klaudios 
Ptolemaios (c. 87-150) is said to have been born. He is usually known by the 
name Claudius Ptolemy, and, since this is almost as much ofa household name 
as Euclid’s, we shall use it henceforth. The Claudius part of his name suggests 


7 These and Hipparchos’ methods have been reconstructed by Toomer in “The chord table 
of Hipparchus and the early history of Greek trigonometry” (1973/74) 6-28. 
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that he was a Roman citizen, but at some time he moved to Alexandria, where 
he studied under Theon of Smyrna, and probably remained at the Museum for 
the rest of his life. 

Ptolemy wrote books on many subjects, such as optics, harmonics, and 
geography, but his fame rests on his great work Mathematike syntaxis (Mathe- 
matical collection) in thirteen books, probably written during the last decade of 
the reign of Titus lius Antoninus (138—161), also known as Antoninus Pius, 
one of the fi e good Roman emperors. This work was also called Megale 
syntaxis (Large collection) to distinguish it from smaller or less-important 
ones. Later, writers in the Arabic language combined their article al with 
the superlative form megiste of megale to make al-mjsty. This is why the 
renamed E wéyioty obvtaéic is usually known as the Almagest. This book 


The firs printed edition of the Almagest, Venice, 1515. 
Reproduced from the virtual exhibition El legado de las matematicas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla. 


did for astronomy and trigonometry what Euclid’s Elements did for geometry; 
for more than a thousand years it remained the source of most knowledge in 
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those fields and there were numerous manuscript copies and translations into 
Syriac, Arabic, and Latin. 

Ptolemy described his intention in writing this compilation at the outset, 
at the end of Article 1:8 


For the sake of completeness in our treatment we shall set out everything 
useful for the theory of the heavens in the proper order, but to avoid undue 
length we shall merely recount what has been adequately established by the 
ancients. However, those topics which have not been dealt with [by our 
predecessors] at all, or not as usefully as they might have been, will be 
discussed at length, to the best of our ability. 


Regarding trigonometry and the table of chords, which are in Book I of 
the Almagest, Ptolemy compiled all the knowledge on this subject up to his 
time, but did not tell us which parts were due to Hipparchos or to Menelaos. 
We shall assume that their results are included in Ptolemy’s presentation. 


1.22 PTOLEMY’S TABLE OF CHORDS 


Tables of chords are important for the same reason that today’s trigonometric 
functions are: they allow us to solve triangles, and this is necessary in applica- 
tions to astronomy and geography. For this reason and because the Almagest 
represented the definit ve state of western trigonometry for the next millen- 
nium, the construction of this table of chords will be presented in sufficien 
detail. 

Ptolemy began [Art. 10]9 by dividing the perimeter of a circle into 360 
parts, and then the diameter into 120 parts. Although Ptolemy used the same 
word here for parts (tu#uwata), these two parts are different. Otherwise, the 
ratio of the perimeter to the diameter would be 3, but Ptolemy determined that 
this ratio is approximately 3 i. We shall call the 360 parts of the perimeter 
“degrees” (from the Latin de gradus = “[one] step away from”), and use the 
modern notation 360°.'° We shall also use “minute” and “second” (derived 


8 Quoted from Toomer, Ptolemy’s Almagest, p. 37. 

° The more verbal translation by Thomas in Selections illustrating the history of Greek 
mathematics, II, pp. 412—443, is closer than Toomer’s to the Greek original style, and, while 
this may be too tiring for the long haul, it may be quite adequate for a shorter presentation. 
For this reason I stay closer to this translation here. Quotations are from this edition and 
page references are to it. 

10 Tt may not be modern at all. For the origin of the present symbols for degrees, minutes, 
and seconds, see Cajori, A history of mathematical notations, 1, 1929, pp. 142-147. 
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from the Latin pars minuta prima, or firs small part, and pars minuta secunda, 
or second small part) for the usual concepts and with the usual notation of one 
or two strokes. We need a different notation for the 120 parts of the diameter, 
and adopt the widely used superscript ” for this purpose. Finally, before 
embarking on his construction of a table of chords of central angles in a circle, 
Ptolemy said that he will [p. 415] 


use the sexagesimal system for the numerical calculations owing to the 
inconvenience of having fractional parts, especially in multiplications and 
divisions." 


With this notational hurdle out of the way, Ptolemy proceeded to state a 
few necessary theorems to compute the chords of angles in steps of half a 
degree. To interpret his results in current notation, refer to the next figur and 


A 


B 


note that if @ is an acute angle and if we denote the length of the chord AB of 
the angle 26 by crd 20, then we have the basic relation 


: ie AB __ erd 26 
sm OA 220A diameter 120 — 


We start with Ptolemy’s firs theorem [p. 415]: 


First, let ABI be a semicircle on the diameter AAT and with centre A, and 
from A let AB be drawn perpendicular to AI’, and let AT be bisected at E, 


a Ptolemy was right in this choice: the number 60 is divisible by 2, 3, 4, 5, 6, 10, 12, 15, 
20 and 30, and of the fractions with denominators 2 to 9 only 60/7 gives a nonterminating 
quotient. By contrast, 10—our likely choice for a system base—is divisible by 2 and 5, and 
has nonterminating quotients when divided by 3, 6, 7 and 9. 
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i, 


and let E B be joined, and let E Z be placed equal to it, and let ZB be joined. I 
say that Z A is the side of a [regular] decagon, and BZ of a [regular] pentagon. 


We omit the proof. Ptolemy’s is somewhat involved, since it depends on 
results from Euclid’s Elements that, besides depending on previous proposi- 
tions by Euclid, are unlikely to be available in the mind of the average reader. 
Itis possible to give a short modern proof, which might be jarring and anachro- 
nistic at this point, so the best compromise is to give a reference. ' 

This lack of proof notwithstanding, if the preceding theorem is accepted, 
then it can be used to start the evaluation of some chords. Thus, Ptolemy 
continued as follows [pp. 417—419]: 


Then since, as I said, we made the diameter consist of 120”, by what has been 
stated AE, being half of the radius, consists of 30” and its square of 900”, 
and BA, being the radius, consists of 60” and its square of 3600”, while the 
square on E B, that is, the square on EZ, consists of 4500”; therefore EZ is 
approximately 67? 4 55, and the remainder AZ is 37? 455. 


Hold your horses, some reader might be thinking. How did they find in 
antiquity, that 4500 = 67? 4 55, and what does this notation mean? The 
easiest thing to explain is the notation: 67” 4 55 means 


67 + : + > 
60 3600 


12 Ptolemy’s proof can, of course, be seen in any of the translations of the Almagest already 
mentioned. The best recent presentation that I have seen of all the results in Article 10, with 
very brief and clear modern proofs, is in Glenn Elert, Ptolemy’s table of chords. Trigonometry 
in the second century, www.hypertextbook.com/eworld/chords/shtml, June 1994. 


SECTION 1.2 PTOLEMY’S TABLE OF CHORDS 13 


in the sexagesimal system that Ptolemy said he was going to use. As for the 
extraction of the square root, he did not offer any explanation whatever. Is 
this is a trivial matter that we should figur out by ourselves? Apparently not, 
because Theon of Alexandria, writing his Commentary on Ptolemy’s Syntaxis 
more than 200 years after its writing, spent some time and effort showing 
how to evaluate this root.'? However, an electronic calculator gives 4500 = 
67.08203933, from which value we get the 67”. Now, 


(0.08203933)(60) _ 4.9223598 
60 ~ 60 


from which we get the 4 pars minuta prima, so to speak. And then 


(0.9223598)(60) _ 55.341588 
60 ~ "60 


which, rounding down, provides the 55 pars minuta secunda. 

Having obtained EZ = 67?4 55 (we are writing equalities for conve- 
nience, but it should be understood that many of these computations provide 
only approximate values), Ptolemy then concluded that the side of the regular 
decagon is 


0.08203933 = 


0.9223598 = 


AZ = EZ — AE = 67° 455 — 30? = 37? 4 55 
and, since this side subtends an arc of 36°, then, using our notation, 
crd 36° = 37?4 55. 


As for the side BZ of the regular pentagon, which subtends an arc of 72°, 
Ptolemy made the following computation (but only in narrative form) [p. 419]: 


BZ =VBA? + AZ? = J602 + (37° 455)2 = V4975? 4 15, 


and then 
crd 72° = 70? 455. 


We have computed only two chords, but a few more are easy. Since the side 
of the regular hexagon is equal to the radius, we have 


crd 60° = 60”; 


13 His evaluation can be seen in Rome, Commentaires de Pappus et de Théon d’Alexandrie 
sur l’Almageste, Il, 1936, pp. 469—473. Also in Thomas, Selections illustrating the history 
of Greek mathematics, I, 1941, pp. 56—61. 
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the side of the regular square is the square root of twice the square of the 
radius, so that 


crd 90° = 60? + 607 = ¥7200 = 84” 51 10; 


and the side of the regular triangle is 3 times the radius (using well-known 
Euclidean geometry the reader can deduce this result in a short time), which 
gives 


erd 120° = 6073 = 10800 = 103” 55 23. 


Now consider two chords subtending a semicircle in the manner of those 
shown in the next figur (not included in the Almagest). Ptolemy observed 


B 


A r 
A 


that [p. 421] “the sums of the squares on those chords is equal to the square 
on the diameter” (since the angle ABI" = 90°); that is, 


AR’ tar? jar?e 


Then he gave the following example. If BI’ is an arc of 36°, then 


erd 144° = AB = JAP? — Br? = 1202 — (3774 55)? = 114”7 37. 


Similarly, 
crd 108° = ¥ 120? — crd*72° = 97° 4 56. 


At this moment we sum up by stating that we have computed the chords of 
36°, 60°, 72°, 90°, 108°, 120°, and 144°. Not a small harvest, but very far 
from the promised table. 

To continue the computation of chords, Ptolemy set out “by way of preface 
this little lemma (Ayjupdtiov) which is exceedingly useful for the business at 


14 Tf we denote the angle BAA in the previous figur_by 20, this equation can be rewritten 
as crd?20 + erd2 (180° — 20) = 1202. Then, if we recall that crd 20 = 120 sin @, it becomes 
120? sin? 6 + 1207 sin?(90° — 6) = 120° or sin? 6 + cos* 9 = 1. 
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hand.” Since there is no record that this lemma was known before Ptolemy, it 
is usually known as Ptolemy’s Theorem [p. 423]. 


Let ABTA be any quadrilateral inscribed in a circle, and let AT and BA be 
joined. It is required to prove that the rectangle contained by AI and BA is 
equal to the sum of the rectangles contained by AB, AI and AA, BT. 


B 0 


Of course, we would prefer to write the theorem’s conclusion as follows: 
AI’: BA=AB.AI+AA: BT, 


but mathematics was expressed verbally in the ancient world and for a long 
time after that. The development of our usual mathematical notation was a 
very slow process that matured only in the middle of the seventeenth century, 
in the work of Descartes and Newton. Be that as it may, starting with the 
next equation, we shall frequently use the signs =, +, and —, and replace “the 
rectangle contained by ...” with a dot to denote a product. 

Here is Ptolemy’s own proof of the stated theorem. It is based on well- 
known Euclidean geometry. 


For let the angle ABE be placed equal to the angle ABI" [that is, choose E 
so that this is so]. Then, if we add the angle E BA to both, the angle ABA 
equals the angle EBT. But the angle BAA equals the angle BIE, for they 
subtend the same segment [the chord AB]; therefore the triangle ABA is 
isogonal (tg0y@viov) with the triangle BE. Therefore, the ratio BI" over 
IE equals the ratio BA over AA. Therefore 


Br-AA=BA.-TE. 


Again, since the angle AB E is equal to the angle ABT’, while the angle BAE 
is equal to the angle BAT, therefore the triangle ABE is isogonal with the 
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triangle BI-A. Analogously, the ratio BA over AE equals the ratio BA over 
AT. Therefore, 
BA- AI =BA-AE. 


But it was shown that 
Br. AA=BA-TE, 


and therefore as a whole [meaning BA. A'+ BI’. AA = BA-. AE+ 
BA-TE=(AE+TE)BA=AT’- BA| 


AI’-BA=AB.AI+AA.- BI, 
which was to be proved. 
No computation of chords was carried out directly using this theorem; this 


was done from its corollaries. Although not labeled or numbered by Ptolemy, 
we Shall do so for easy reference [p. 425]. 


COROLLARY I. 


This having first been proved, let ABTA be a semicircle having AA for its 
diameter, and from A let the two [chords] AB, AI be drawn, and let each of 
them be given in length, in terms of the 120” in the diameter, and let BT be 
Joined. I say that this also is given [here “given” means “found”. 


r 


Ptolemy’s sketch of proof, for it is no more than that, is very short. 


For let BA, I’A be joined; then clearly these also are given because they are 
the chords subtending the remainder of the semicircle. Then since ABI’A is 
a quadrilateral in a circle, 


AB-TA+AA-BIr=AI- BA 
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[by Ptolemy’s theorem]. And AJ”. BA is given, and also AB - ’A; therefore 
the remaining term AA- BT isalso given. And AA is the diameter; therefore 
the straight line BI” is given. 


But this corollary gives no formula to fin BI. We now fil in the missing 
steps. From the equation stated in this sketch of proof we obtain 


_ AP-BA-—AB-TA AI V AA? — AB? — ABV AA? — AF? 
_ AA 7 AA 


Br’ 


’ 


and BI can be found from the given chords and the diameter. '® 
Now, returning to the computation of chords, Ptolemy wrote that 


by this theorem we can enter [calculate] many other chords subtending the 
difference between given chords, and in particular we may obtain the chord 
subtending 12°, since we have that subtending 60° and that subtending 72°. 


In fact, using the chord formula just developed and the values already obtained 
for crd 72° and crd 60°, we would arrive at 


crd 12° = crd (72° — 60°) = 12? 32 36. 


Itis possible to obtain more than this. By successive application of the formula 
we can calculate crd 18° = crd (108° —90°) and then crd 6° = crd (18° —12°), 
but Ptolemy did not do this, and he had a good reason for it, to be disclosed 
later. In any event, our table of chords is beginning to fles out, but is not yet 


15 The previous equation can be expressed in terms of the trigonometric function crd if 
we denote the arcs AB and AI by 2a and 20, respectively. Then it becomes 


2 2 2y — 2 2 
edd — 20) = crd 26 1/120 erd*2a ore vV 120 erd*20 


12 
/ d72 206 
= crd20,/1— = — crd2a ¢ 2 
1202 1202 


Dividing both sides by 120, using the identity crd 26 = 120 sin 6, and noticing that 0° < a, 
@ < 90°, this equation can be rewritten as 


sin(@ —a) = sind V1 — sin? a — sina v1l—- sin? @ = sin@ cosa — sina cos@. 


We see that this familiar trigonometric formula was implicitly contained in Ptolemy’s work. 
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near the promise of findin the chords of all arcs in half degree intervals. So, 
more trigonometry must be developed [p. 429]. 


COROLLARY 2. 


Again, given any chord in a circle, let it be required to find the chord subtending 
half the arc subtended by the given chord. Let ABT be a semicircle upon the 


B 


A EZI 


diameter AI and let the chord IB be given, and let the arc IB be bisected 
at A, and let AB, AA, BA, and AT be joined, and from A let AZ be drawn 
perpendicular to AI. I say that ZI is half of the difference between AB 
and AI. 


We can abbreviate Ptolemy’s proof as follows. Choose a point E on AI’ so 
that AE = AB. Inthe triangles ABA and AEA, AE = AB, AA isacommon 
side, and the angle BAA is equal to the angle EAA (because the arcs BA and 
AT are equal by hypothesis); and therefore BA = AE. Since BA = AT, we 
obtain A = AE, so that AET is isosceles, and then 


ZP = 5ED = 5(AD — AE) = 5(AT — AB). 


This finishe the proof of the last statement in the corollary, but we are not 
done with the required task, which is to fin the chord AI. To this end, note 
that the right triangles AAI and AZT are isogonal, since they have the angle 
at in common, and then 

AI TA 


TA Zz 


so that 
TA? = Ar -TZ. 


Ptolemy got to this point but did not provide an equation giving A. Lacking 
any type of notation for equations, this is not surprising, but we can do it easily. 
Replacing Z with its value found above, 


PA? = $A: (AP — AB) = $A [AP — VAP? = Br? J. 


SECTION 1.2 PTOLEMY’S TABLE OF CHORDS 19 


Finding the square root of both sides and recalling that A” = 120? yields 


ra = J}(120? - 120/120? — Br?).."6 


Ptolemy summed up the usefulness of this result as follows [p. 431]: 


And again by this theorem many other chords can be obtained as the halves 
of known chords, and in particular from the chord subtending 12° can be 
obtained the chord subtending 6° and that subtending 3° and that subtending 
15° and that subtending $°+4° (= 3°). Weshall find when we come to make 
the calculation, that the chord subtending | - is approximately 1” 34 15 (the 


diameter being 120”) and that subtending i is 0? 47 8. 


This is probably the reason why Ptolemy did not compute the chord of 6° from 
Corollary 1, namely that it can be done more simply from Corollary 2. His 
next result was [p. 431]: 


COROLLARY 3. 


Again, let ABTA be a circle about the diameter AA and with center Z, and 
from A let there be cut off in succession two given arcs AB, BI’, and let there 
be joined AB, BI’, which, being the chords subtending them, are also given. 
I say that, if we join AT, it will also be given. 


To prove this, construct the diameter BE and the remaining segments 
shown in the picture. By Ptolemy’s theorem, applied to the quadrilateral 
BIAE, 

BA-TE=BI-AE+ BE -TA. 


16 If we denote the are BI” by 26 (this makes 6 < 90°), in which case A = crdé@ and 
BI =crd 20, then 


2 


Dividing both sides by 120, using the equation crd 20 = 120 sin @ (but with @ in place of 20), 
and in view of the fact that 6 < 90°, the above equation takes the form of the well-known 
trigonometric identity 


4 1-1 -sin20 joa 
sin 50 = = ‘ 
2 2 


1202 — 120,/ 1202 — crd 226 
crd@ = : 
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Therefore, 
BA-TE-BI-AE 47 


BE 


Now, Bland AB are given, while BA and IE can be found from AB and BI, 
respectively, using the Pythagorean theorem. Thus, since AE = AB, TA can 
be determined from the preceding equation, and then 


AD = VAA2 — TA?. 


As for the computation of chords, which was Ptolemy’s task, this is the 
way he put it [p. 435]: 


TA 


It is clear that, by continually putting next to all known chords a chord 
subtending | 7 and calculating the chords joining them, we may compute in 
a simple manner all chords subtending multiples of 14°, and there will still 
be left only those within the 15° intervals—two in each case, since we are 
making the diagram in half degrees. 


17 If we denote the arcs AB and BI” by 20 and 2a, respectively, recall that BE = 120?, 
and observe that AE = AB, the equation for "A can be rewritten as 
erd (180° — 24) erd (180° — 2a) — erd 2a crd 20 
120 


Using the identity crd2@ = 120sin@ once more and then dividing by 120, this equation 
becomes 


crd (180° — 20 — 2a) = 


sin(90°— 6 — aw) = sin(90° — 9) sin(90° — a) — sina sind, 
or equivalently 
cos(0 + a) = cos@ cosa — sina sind, 


another modern, well-known trigonometric identity. 
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The next difficult , in order to complete the table in intervals of half a 
degree, is precisely the computation of the chord of half a degree. Ptolemy 
did this from the following “little lemma” (another Anuatiov) [pp. 435-437]. 


For let ABTA be a circle, and in it let there be drawn two unequal chords, of 


Lia, 


\ FE 


A 


which AB is the lesser and BI the greater. I say that 


IB arc BI" 
EE os . 
BA arc BA 


To the attentive reader this should be a case of déja vu. This is precisely 
the trigonometric identity that Aristarchos had used in the proof of his Propo- 
sition 4, and it is sometimes known as Aristarchos’ inequality. But while he 
had made us think that it is a well-known fact by the absence of proof, Ptolemy 
provided a proof. It depends on some propositions from Euclid’s Elements, 
which were well known to Ptolemy and that he mentioned without a reference. 
Readers who are less familiar with Euclidean geometry may wish to consider 
the following just a sketch of proof. 

Let BA be the bisector of the angle ABI, and let new points H, E, Z, 
and © be as shown, the firs three on an arc of circle with center A. It should 
be clear that 


area of triangle AEZ area of sector AEO 


< 
area of triangle AEA _—_area of sector AEH 


Since the two triangles in this inequality have the same height AZ and since 
the two sectors have the same radius, the inequality reduces to 

EZ LZAE 

— < —__., 

EA LEAA 
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where we have used the symbol / for angle. Adding 1 to both sides and 
simplifying '® gives 

ZA LZAA 

EA LEAA’ 
and then, multiplying both sides by 2 and noting that Z bisects AI because 
BA bisects the arc AAT, 


TA LTAA 
EA ~ LEAA 
Subtracting | from both sides and simplifying yields 


TE LTAE LIAB 
< = . 
EA LEAA LBAA 


Using now the well-known facts '® 


TE TB LIAB arc I"B 


— = __ an = 
EA BA LBAA arc BA 


proves the lemma. 


Having concluded the theoretical part of his task, Ptolemy had to complete 
the computations. First he needed the chord of 4°, and to obtain it he used the 
next figur [p.441] showing two chords with acommon endpoint.2° Assuming 
firs that AB subtends an angle of * and AI an angle of 1°, and using the 
inequality in the preceding lemma but with AT in place of 'B, 
arcAT 4 


AI’ < BA = 
arc BA 3 


18 This operation on a fraction was well known at least since Euclidean times, and only 
one word was used to describe it. Ptolemy used ovv@évti (synthenti), meaning “putting 
together.” When these works were later translated into Latin the word componendo was 
used in its place, and this word was still frequently used well into the seventeenth century. 
Similarly, the operation described below, in which | is subtracted rather than added, was 
called dieAdvt1 (dielonti) by Ptolemy, meaning “having [the fraction] divided,” and was 
translated as dirimendo. 

19 They were well known to Ptolemy, since these are Propositions 3 and 33 of Book VI 
of Euclid’s Elements [pp. 195 and 273 of the Dover Publications edition, 1956]. Readers 
with a knowledge of high-school geometry will have no trouble proving these facts using 
the following hints. For the first draw a line through A parallel to E B until it intersects the 
line containing the segment BI at a point that can be denoted by J, and show firs that AB 
and BI have the same length. For the second, recall that an angle inscribed in a circle, such 
as ‘AB, is one-half of the central angle subtended by the same chord. 

20 This new figur is unnecessary, for the previous one shows such chords, and it is 
unfortunate that Ptolemy decided to change the notation on us for the new figure 
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Since BA = crd i* was shown just before Corollary 3 to be 0” 47 8, it follows 
that four-thirds of this value is 1” 2 50, and then 


erd 1° = AT < 17250. 


Next, use the same chords but assume now that AB subtends an angle of 1° 
and AJ an angle of 1 x. As above, we have 

arcBA 2 

arc AD 3 


Since we have already found that A = crd 15° = 1/34 15 and two-thirds 
of this value is 1? 2 50, we have 


BA> AI" 


crd1° = BA > 1?250. 


Since crd 1° has been shown to be both smaller and larger than 1? 2 50, it must 
“have approximately this identical value 1” 2 50.”21 Using now Corollary 2 
(or, rather, the formula developed after it for the chord of half an arc), we fin 
that 

ord 5° = 0731 25. 


Ptolemy concluded as follows [p. 443]: 


The remaining intervals may be computed, as we said, by means of the chord 
subtending 1 x. In the case of the firs interval, for example, by adding > we 
obtain the chord subtending 2°, and from the difference between this and 3° 
we obtain the chord subtending 24, and so on for the remainder. 


21 Ptolemy’s values for crd 1 7 and erd . are only approximations. More precise calcu- 
lations would show that 
172492 <crd1° < 17250, 


validating Ptolemy’s assertion. 
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Then he or his paid calculators evaluated the entries in the table, which made 
up Article 11. We show in the next table the firs six lines as a sample of this 
arrangement. 


Arcs Chords Sixtieths 
5 | 0/31] 25)0]1/)]2] 50 
12 | 1] 2/50] 0)1] 2] 50 
15° | 1] 34) 15] 0/142] 50 
2° |2) 5/40|)0]1) 2/50 
25 |2)37] 4/0)142) 48 
3° | 3} 8] 28|0)1 42) 48 


This is Ptolemy’s partial description of the table [p. 443]: 


The firs section [one column] will contain the magnitude of the arcs in- 
creasing by half degrees, the second will contain the lengths of the chords 
subtending the arcs measured in parts of which the diameter contains 120, 
and the third will give the thirtieth part of the increase in the chords for each 
half degree, in order that for every sixtieth part of a degree we may have a 
mean of approximation differing imperceptibly from the true figur and so be 
able to readily calculate the lengths corresponding to the fractions between 
the half degrees. 


While the firs part of this statement is clear, the second may benefi from an 
example. There are thirty intervals of 1’ from 7 to 1°, and if the difference 
1?250 —0?31 25 = 0731 25 is divided by 30 we obtain 0” 1 2 50. Thus, 


we can take 
erd 7 1’ + 0731 254071250 = 0? 32 27 50. 


Within an interval as small as , this provides a good approximation. 

With this we conclude our description of only Articles 10 and 11 of Book I 
of the Almagest. The complete work is much larger and, although much of it is 
a compilation of previous knowledge, the last fi e books on planetary motion 
represent Ptolemy’s most original contribution. This part of the Almagest has 
been called a masterpiece and remained the standard in astronomy until the 
sixteenth century. 
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1.3 THE INDIAN CONTRIBUTION 


The Gupta empire of northern India was founded by Chandragupta I, who 
ruled from Pataliputra, or City of Flowers, the modern Patna on the Ganges 


Pataliputra 


* Ujjain 


River. Near Pataliputra, but the exact location is unknown, was the town 
of Kusumapura that would emerge as one of the two major early centers of 
mathematical knowledge in fifth and sixth-century India (the other one was 
at Ujjain). It is not surprising that Kusumapura became a center of learning, 
being near the capital of the empire and center of the trade routes. Knowledge 
from and to other parts of the world would naturally fl w through Pataliputra 
in that golden age of classical Sanskrit. 

An astronomer called Aryabhata may have worked in Kusumapura in the 
time of the emperor Budhagupta. Although the exact place of Aryabhata’s 
birth is not known (he is generally thought to be a southerner, possibly from 
as far as Kerala, on the southern coast of India), the date is. He himself said 
that he was 23 years of age when he finishe a work called the Aryabhatiya 
(Aryabhata’s work) in 499. Thus, he was born in 476, and he lived until 550. 
The Aryabhatiya is the only text of Aryabhata that has survived. It is a brief 
volume written in verse as a mnemonic aid, since some of its portions were 
meant to be memorized rather than read. It consists of only 123 stanzas (of 
which fi e may have been added later), and is divided into four sections, each 
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of them called pada: an introduction, a calculations section or ganitapada of 
33 verses—containing the trigonometry we want to examine—and the rest is 
on astronomy. 

Aryabhata was not the only Indian scholar whose work on trigonometry has 
survived. Several other works on astronomy, called Siddhantas or “established 
conclusions,” were written in what we may call medieval or pre-medieval 
India. Of these, only the anonymous Surya Siddhanta is completely extant. 
The versions of both books that have survived to the present are from the 
sixteenth century, and, while it is believed that the copy of the Aryabhatiya is 
identical or almost identical to the original, it is known that that of the Surya 
Siddhanta has undergone many alterations. 

Only the facts are presented in these texts, but this knowledge is assumed 
to be ancient and of divine origin (Surya means Sun, and this was supposed to 
be a book of revelation of the knowledge of the Sun), and no mention is made 
of either astronomical observations or mathematical deductions that may have 
led to it, nor is credit given to any preceding mathematicians. This makes it 
impossible to determine whether the Hindus were original or indebted to the 
Greeks or the Babylonians. 

If Aryabhata and other Siddhanta authors were aware of the Greek work 
on chords, they replaced this concept with that of the half-chord, as shown 
in the next figure and this has remained their most important and lasting 


A 


contribution. Actually, Aryabhata talked of the half-chord, ardha-jya, and of 
the chord half, using the name jya-ardha as was done in the Surya Siddhanta 
once, but later both documents simply used the word jya, also spelled jyva, 
for chord. 

To construct a table of jyvas—if we are permitted this plural—the Hindus 
needed to choose a unit of length. Departing from Hellenic use, they replaced 
the two Ptolemaic units—one for arcs and one for parts of the diameter—by a 
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single unit. They divided the length of the circumference into 21,600 parts and 
then used any one of these parts as their only unit of length. We can call these 
parts minutes, since there are 21,600’ minutes in a complete circumference of 
360°. 

How can the jyva be measured in this way? Very simply, and it is well 
to remember that we do something similar routinely today: firs we select the 
radius as the unit of length, and then we use this length to obtain the radian 
measure of an arc. To do this we use our knowledge of the value of x, which 
is define as the ratio of the length of the circumference, C, to the diameter. 
For us x = C/2, since the length of the diameter is two radians, and then 
C = 2z radians. For the Hindus, C = 21,600’, and if R denotes the length of 


the radius, 
in 21,600 _ 10,800 


20 TU 
In either case, one needs the value of z. 
The Surya Siddhanta gave the approximation x ~ \/10 in stanza 59 of 
Chapter I, but this is not the value used later to compute the jyvas. Aryabhata 
was a little more careful in the tenth stanza of the ganitapada:?? 


10. One hundred plus four, multiplied by eight, and added to sixty-two 
thousand: this is the nearly approximate measure of the circumference of a 
circle whose diameter is twenty thousand. 


He did not include an explanation, but the result of Aryabhata’s calculation is 
62,000 + 8(100 + 4) = 62,832, 


which divided by a diameter of 20,000 gives a quotient of 3.1416 as an ap- 
proximation of zr, the best approximation to his day. Thus, Aryabhata could 
fin the length of the radius to be 


10,800 
R= 
3.1416 


This is, then, the jyva of 90°.23 


3438. 


22 Shukla and Sarma, Aryabhatiya of Aryabhata, 1976, p. 45. The translations from the 
Aryabhatiya included here are from this source, but I have consistently replaced their word 
“Rsine” with jyva. 

2 If Aryabhata used his value of zr, he rounded up to obtain R, so his value is not precise. 
An electronic calculator will give the quotient 10,800/z as 3437.746771’ or 57° 17' 44.8”, 
which is closer to the true radian. 
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From this and a few elementary mathematical facts the Hindu mathemati- 
cians were able to construct a table of jyvas. They are given in the Surya 
Siddhanta in stanzas 15 to 22 of the second chapter, beginning as follows (an 
explanation will be given after each quotation): 4 


15. The eighth part of the minutes of a sign is called the firs half-chord 
(jyardha); that, increased by the remainder left after subtracting from it the 
quotient arising from dividing it by itself, is the second half-chord. 


Once more, since there are 12 signs of the zodiac ina full circle of 360°, a sign 
of the zodiac is equivalent to 30°, or 1800’, and the eighth part of this is 3 i’, 
or 225’. This is the value of firs half-chord, the jyva of 3 3° (the Hindus knew, 
as we do, that for small angular values the arc subtended by the angle and the 
half-chord are approximately equal). The quotient arising from dividing it by 
itself is 1, and then the remainder left after subtracting this quotient from the 
firs jyva is 224. Increasing the firs jyva by this amount, we obtain the second 
jyva: 225 + 224 = 449, 

Stanza 16 then gives the general rule to obtain an arbitrary jyva from the 
preceding ones. It is understood, although not so stated in the rule, that we 
are computing the jyvas of equally spaced angles in intervals of 3 2". 


16. Thus, dividing the tabular half-chords in succession by the first and 
adding to them, in each case, what is left after subtracting the quotients from 
the first the result is twenty-four tabular half-chords (jyardhapinda),?° in 
order, as follows: 


Thus, each jyva is obtained by adding to the preceding jyva what is left after 
subtracting from the firs jyva the quotients obtained by dividing all preceding 
Jyvas by the first 

To express the rule for obtaining jyvas in familiar mathematical terms 
rather than in narrative form, we shall coin a temporary trigonometric function, 
denoted by jya. Then, if we denote the angle 33° by @, the firs jyva is just 
jyaé@ and the nth, where n is a positive integer between | and 24, is jyand. 


24 Burgess, “Surya-Siddhanta. A text-book of Hindu astronomy,” 1860, p. 196. The trans- 
lations from the Surya-Siddhanta included here are from this source, but I have consistently 
replaced the word “sine” with “half-chord” (Burgess gave transliterations of the Sanskrit 
terms in parentheses). 

25 According to Burgess, p. 201, pinda means “the quantity corresponding to.” 
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Then the stated rule, to obtain each jyva by adding to the previous one “what 
is left after subtracting the quotients from the first” can be written as 


n—1 


F : : jya ko 
jyané = jya(n — 1) + jyao-— > i , 
& iyae 
n=1,..., 24. The inclusion of jya0° = 0 in the sum on the right, which was 


not explicitly done by the Hindu author, does not alter the sum. For instance, 
for n = 3, the right-hand side becomes 


225 449 
449 + 225 ~ 449 + 225 —1—-2= 671, 
225 225 vines _ 


which is jya 30. 

Using the jyva rule, the jyvas of the arcs in the firs quadrant in intervals of 
33° can be found to be (approximately) 225, 449, 671, ..., 3431, and 3438. 
These are the values stated in Sanskrit verse in stanzas 17 to 22 of the Surya 
Siddhanta, but it is easier for us to read them in table form.26 


Are Jiva Arc Jiva Arc Jiva 


33° | 225’ | 333° | 1910’ || 632° | 3084’ 


4 4 4 
7° | 449’ || 374° | 2093’ || 673° | 3177’ 
11d | 671" | 415° | 2267’ || 714° | 3256’ 


15° | 890’ | 45° | 2431’ || 75° | 3321’ 
183° | 1105’ || 482° | 2585’ || 782° | 3372’ 
225° | 1315’ | 525° | 2728’ || 825° | 3409’ 
26; | 1520’ || 565° | 2859’ |] 863° | 3431’ 
30° | 1719’ |} 60° | 2978" || 90° | 3438’ 


Aryabhata did not express the jyvas directly, but rather, in the twelfth stanza 


26 A reproduction of the table of half-chords, from a Sanskrit edition of the Surya Siddhanta 
published at Meerut, India, in 1867, can be seen in Smith, History of mathematics, I, 1925, 
p. 625. 
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of his introduction, or gitikapada,”’ to the Aryabhatiya he gave a sequence of 
differences between the values of the jyvas shown here [p. 29]: 


12. 225, 224, 222, 219, 215, 210, 205, 199, 191, 183, 174, 164, 154, 143, 
131, 119, 106, 93, 79, 65, 51, 37, 22, 7. These are the jyva differences in 
terms of minutes of arc. 


The reason for giving differences is related to the rule that he used to compute 
the stated values, in the twelfth stanza of the ganitapada:7® 


TARTETed ated Rdlaaz | 
qaaasaraigeddeana = erie kU 


which translates as follows:29 

12. The firs jyva divided by itself and then diminished by the quotient 
will give the second jyva difference. For computing any other difference, 
[the sum of] all the preceding differences is divided by the firs jyva and the 
quotient is subtracted from the preceding difference. Thus all the remaining 
differences [can be calculated]. 


The interpretation of this statement is straightforward. The quotient of the firs 
Jyva difference, jyaO — jya0 = 225, by itself is 1, which subtracted from 
jya@ gives the second jyva difference: jya26 — jya@ = 224. To compute 
any other difference, jya(n + 1)0 — jyan6é,n =1,2,... , 23, the sum of all 
the preceding differences, 


n 


> Liyake — jya(k — 16] = jyand, 
k=1 


27 The introduction includes ten aphorisms in the gitika meter, called dasagitika-sutra, 
which Aryabhata had previously written as an independent tract. The stanza quoted here 
is the tenth stanza in the dasagitika-sutra, and is numbered in this manner in Clark, The 
Aryabhatiya of Aryabhata, 1930, p. 19. 

28 It shows the use of the word sapg = = (ya), also spelled sfaq (jiva) in stanza 23 of 
the Golapada, the fourth part of the Aryabhatiya. The Sanskrit version reproduced here is 
from Shukla and Sarma, Aryabhatiya of Aryabhata, p. 51. 

29 From the available translations I have chosen the one given by Bibhutibhushan Datta 
and Avadesh Narayan Singh in History of Hindu Mathematics, Part III. This part remains 
unpublished, but this translation is reproduced in Shukla and Sarma, Aryabhatiya of Aryab- 
hata, p. 52. 
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is divided by the firs jyva and the quotient is subtracted from the preceding 
difference: 


J 6 
jya(n + 1)0 — jyané = jyané — jya(n — 1)0 a . 
jyaé 
That is, 
: : ; jyané 
Jya(n + 1)6 = 2jyandé — jya(n — 1)0 - : 
jyaé 


Taking n = 2 as an example to fin the jyva of 11 a we have 


iya 20 449 
Jy# iY = 898 — 225 =63—2= 671, 


jya36 =2jya20 — jyad 
Jya Jya Jya 35 


where the quotient on the right has been rounded to the nearest whole number. 

Aryabhata’s rule is different from the one in the Surya Siddhanta only in 
appearance. Replacing n with n + 1 in the Surya Siddhanta rule on page 29 
yields 


"\ jyako 
jya(n + 1)6 = jyand + jyad = se 
fa jyaé 
n= 1,...,23. Subtracting now the firs form from this one, we have 
jyané 


jya (n+ 1)6 — jyan@ = jyané — jya(n — 1)0 - 
jyaé 
which is Aryabhata’s rule. 

This rule, or its simplifie form, is easier to handle than the one in the 
Surya Siddhanta because it does not have a long sum on the right. We assume 
that the rule is only approximate, but may reasonably ask whether the last term 
on the right can be replaced by another term T (@), to be determined, such that 


jya(n + 1)O = 2 jyané — jya(n — 1)6 — T(8) 


exactly. This will be a simpler task if we switch from jyvas to sines using the 
equation jyaa = Rsina, where a is any angle between 0° and 90° and R is 
the length of the radius given in minutes. Then we can rewrite the previous 
equation as 


T (0) 


sin(n + 1)@ = 2sinné — sin(n — 1)0 z 
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Using the formulas for the sine of a sum and a difference of two angles— 
already implicit in Ptolemy’s work for chords, and quite possibly known to 
the Hindus for jyvas—and simplifying reduces this equation to 


T (0) 
—_ 


2(cos@ — 1) sinné = — 
or 
T(9) = 2R(1 — cos) sinn@ = 2(1 — cos 6) jyand. 


Using a hand-held calculator to fin cos 6 = cos 3.75°, we obtain 
T (8) = 0.004282153 jyand, 


and when this is compared with the term 


yan = or = 0.004444444 jyand, 
iya 


it follows that ; 
jyané 
jyaé 
Thus we have found the exact formula for the computation of jyva differences 
and have shown to what extent the Hindu formula is an approximation. 

The Hindu astronomers of the fift century could not have given an exact 
formula for T(9) because the cosine was unknown to them. However, there 
is another trigonometric length that the author of the Surya Siddhanta could 
have used: the versed half-chord. It is defined with the name utkramajya 
(reverse-order jyva), in Chapter II, in the second part of stanza 22, as follows 
[p. 196]: 


= 1.037899497 T (0). 


22. ... Subtracting these [the jyvas], in reverse order from the half- 
diameter, gives the tabular versed half-chords (utkramajyadrdhapindaka). 


To understand this definition note that ifa = n6 for some n between 0 and 24, 
then to subtract its jyva from the radius in reverse order means to subtract the 
Jyva of (24 — n)@ = 90° — a. For instance, if R is the length of the radius in 
minutes, the utkramajya of 6 = 3 * is 


R— jya (90° se 33°) = 3436 — 34517. 


The values of the utkramajyas are then given in stanzas 23 to 27 of the Surya 
Siddhanta from 7 for a = @ to 3438 for a = 90°. 
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In general, if we denote the utkramajya of a by uyaa, then 
uyaa = R— jya(90° — a) = R— Rsin(90° — a) = R(1 — cosa), 
and, in particular, 
2 jyané uyaé 
R 


Thus, the author of the Surya-Siddhanta missed an opportunity to give the 
following exact equation for the computation of jyvas: 


T@)= 


2] 0 6 
jya(n + 1)9 =2 jyand — jya(n — 0 — 2 = ws haa 
n = 1,...,23, instead of an approximation. Of course this formula would 


not give correct values using the approximations jya@ ~ 225 and uyaO ~ 7 
instead of the closer values 224.8393963 and 7.360479721, which present-day 
technology can produce in an instant. 

The origin of the jyva-producing rule has also been a subject for spec- 
ulation. Of course, the answer is not known because the astronomers who 
used it did not include a derivation or an explanation of its origin. This is all 
Aryabhata had to say about the computation of jyvas in the ganitapada [p. 45]: 


11. Divide a quadrant of the circumference of a circle (into as many parts 
as desired). Then, from (right) triangles and quadrilaterals, one can fin as 
many jyvas of equal arcs as one likes, for any given radius. 


He did not, however, explain how it is done. Several possible explanations 
have been proposed since the nineteenth century, but the truth is unknown.°° 

More than a century after Aryabhata, Brahmagupta (598—670), working 
at Ujjain, incorporated trigonometry in his work Brahmasphuta Siddhanta.*" 
After changing Aryabhata’s value of the radius from 3438 to 3270, he in- 
cluded some interpolation procedures to fin the jyvas of arcs that are closer 
together than 225’. His work may have had a profound influenc on the de- 
velopment of trigonometry, since it was later studied by astronomers outside 
of India. Another mathematical astronomer, called Bhaskara (c. 600—c. 680), 
gave an algebraic formula, in a work called Mahabhaskariya, to compute an 
approximation to the jyva of an arbitrary angle that does not rely on the 225’ 
differences [pp. 207—208 of the included translation]: 


30 The explanation proposed by the French astronomer Delambre (1749-1822) in Histoire 
de l’astronomie ancienne, I, 1815, pp. 457—458 deserves consideration. 

31 The word sphuta means “corrected.” Thus, this is the corrected Brahma Siddhanta, an 
earlier work, now lost. See Burgess, “Surya-Siddhanta. A text-book of Hindu astronomy,” 
pp. 419 and 421. 
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Subtract the degrees of the bhuja [arc] from the degrees of the half-circle. 
Then multiply the remainder by the degrees of the bhuja and put down the 
result at two places. At one place subtract the result from 40500. By one- 
fourth of the remainder [thus obtained] divide the result at the other place 
as multiplied by the antyaphala [radius]. Thus is obtained the [jyva to that 
radius]. 


In other words, if the angle in question is denoted by 6, then 


; RO(180 — 0) 
jyad = 4 7 
7140,500 — 6(180 — @)] 


Bhaskara did not bother to tell us how he obtained this formula, but it provides 
a good approximation. 


1.4 TRIGONOMETRY IN THE ISLAMIC WORLD 


The Hellenic civilization of Egypt slowly declined through the centuries, the 
Roman Empire itself was gone with the wind of the barbarian invasions, plung- 
ing western Europe in the dark ages, and only the Byzantine Empire in the 
east managed to hold on, although without much luster. But the void would 
be fille soon, after Mohammed ibn Abdallah (c. 570-632) founded in 612 
a new religion, Islam (meaning “submission” to Allah). Its adherents, called 
Muslims, set out to the conquest of infide lands shortly after the death of 
Mohammed. It was thus that a new empire was born, and by 642 it extended 
from northern Africa to Persia. 

This success notwithstanding, there was disagreement over the matter of 
Mohammed’s successor or caliph (from khalaf, to succeed) from the very firs 
moment after his death. Mohammed’s cousin and son-in-law, Ali ibn Abu 
Talib, thought he was the intended successor, but a group of Muslim leaders 
chose Mohammed’s father-in-law to become the firs caliph. Eventually, Ali 
managed to become the fourth caliph in 656, on the assassination of Uth- 
man ibn Affan, the third caliph, by rebel factions. In turn, members of the 
Umayyad family, to which Uthman belonged, ousted Ali—who would end up 
assassinated by some of his ex-followers—and proclaimed one of their own, 
Muawiya ibn Abu Sufyan, as caliph in 661.94 


32 The followers of Ali, or Shi’at ul-Ali, became known later as Shi’ites. Those who 
believed that the prophet had left the choice of successors entirely to the people were known 
as Sunnis. 
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Biruni 
Samarkand 
e Harran Marw 
Tus ° 
<7 » Samarra © Buzjan 


* Damascus * Baghdad 


e Medina 


The Umayyads, who turned the caliphate into a dynasty and moved their 
capital from Medina (more fully, al-Madinah an Nabi, meaning “the City of 
the Prophet”) to Damascus, greatly expanded the Islamic empire, building an 
efficien government structure in the process, with a large number of Christian 
Byzantine administrators. By the time the Muslims were checked at Poitiers, 
near Tours, by Charles, natural son of Pepin of Herstal, and his Franks in 
732—Charles would be called Martel (the Hammer) after this victory—they 
dominated a large portion of the world, from Hispania in the west to the Indus 
river in the east. The court structure that they built may have clashed with the 
simplicity of early Islam, but it was a source of culture in the arts and sciences. 

Arts and sciences notwithstanding, the Umayyads were not popular in some 
quarters, definitel not in the mind of those who thought they had betrayed 
the spirit of Islam. A revolt led by Abu al-Abbas (722—754), a descendant 
of a paternal uncle of Mohammed, ended the Umayyad dynasty in Syria with 
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the massacre of most of the Umayyads on June 25, 750. This earned Abu 
al-Abbas the name extension al-Saffah (the Blood-shedder). 

Abu Jafar Abdullah ibn Mohammed al-Mansur (the Victorious) (712—775), 
the second caliph of the Abbasid dynasty, as it would be known, founded the 
city of Baghdad (or “God-given” in Middle Persian) on July 30, 762, on the 
recommendation of the court astrologer, and transferred the capital there. This 
was a city built in the shape of a circle, surrounded by a moat, and with the 
mosque in the center. Here Al-Mansur established a program of translation 
from foreign languages, with a large mathematical and astronomical com- 
ponent, that lasted for well over two hundred years and was the foundation 
of much scholarly work in the caliphate.°° The historian Abu’l Hasan Ali 
ibn Husayn ibn Ali al-Masudi (c. 895-957) stated that al-Mansur sponsored 
Arabic translations of Ptolemy’s Almagest, the Arithmetike eisagoge of Nico- 
machos of Gerasa, and Euclid’s Elements (not extant).°4 There was also a 
work referred to as the Sindhind that had been brought from the land of Sind 
(modern Pakistan) to Baghdad about 766 by an Indian scholar named Kanka. 
It has been suggested that this work may have been the Surya Siddhanta, 
but it is considered more likely that it was the Brahmasphuta Siddhanta of 
Brahmagupta. It was through a translation of this document, commissioned 
by al-Mansur and made by Mohammed ibn Ibrahim al-Fazari, that Indian 
astronomy and mathematics became known in the Islamic empire. 

Under Harun al-Rashid (the Upright) (764-809), who became the fift 
Abbasid caliph in 786, the court of Baghdad reached a peak of splendor, 
and he himself is known as the main character in the The thousand and one 
nights. He commissioned a translation of Euclid’s Elements, prepared by al- 
Hajjaj ibn Yusuf ibn Matar, a manuscript copy of which still exists. After a 
disastrous civil war, it was with Harun’s son Abdullah al-Mamun ibn Harun 
abu Jafar (786—833), the seventh of the Abbasid caliphs, who welcomed all 
kind of scholars to his court, that intellectual life flourished In mathematics, 


33 Al-Mansur initiated this activity for political reasons. When Alexander the Great con- 
quered Persia and killed king Darius III, he had all the documents in the archives of Istahr 
(Persepolis) translated into Greek and Coptic, and then destroyed the originals. The neo- 
Persian empire of the Sasanians (225—651) then attempted to recover their cultural heritage 
through a program oftranslations from the Greek into Pahlavi (Middle Persian). Al-Mansur’s 
decision to continue the translation program was part of his policy to adopt and incorporate 
the ideology of the large Persian component of the population in this part of the Islamic 
empire. For a complete account of the translation movement see Gutas, Greek thought, 
Arabic culture, 1998. 

34 Tn his book Muruj al-zahab wa al-maadin al-jawahir (Meadows of gold and mines of 
gems), 947, §3458 
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not only were additional translations of the Elements and the Almagest made 
during Al-Mamum’s reign, but also significan original work was produced, 
such as the treatise Kitab al-jabr wa’l-mugqabala (Treatise of restoring and 
balancing), dedicated to al-Mamun by Mohammed ibn Musa al-Khwarizmi 
(c. 780-850), the greatest mathematician of medieval Islam.*° It was on the 
translation of Kanka’s Siddhanta that al-Khwarizmi based the construction of 
his astronomical tables. 

As for trigonometry, it was still subservient to astronomy in the Islamic 
world, mainly through the elaboration of tables of half-chords and other 
lengths. For this is one of the main contributions in Arabic to the development 
of trigonometry: the introduction of all six trigonometric lengths (they were 
still lengths and not functions as they are today). The Hindu jyva was adopted 
over the Greek chord, but since there is no v sound in Arabic it became jyba 
or simply jyb (Cu). The cosine of an arc smaller than 90° was known and 
used as the jyba of the complement of the arc. In this way it was used by 
the famous astronomer Mohammed ibn Jabir al-Battani al-Harrani (c. 858— 
929)—-working in Samarra, now in modern Iraq—in determining the altitude 
of the sun. It appears, together with a table of its values, in his book On the 
motion of the stars, written about 920. What we now know as the tangent 
and cotangent appeared firs as shadows of the gnomon, or miqyas (_ je) in 
Arabic, as shown in the next figure 


migyas 
Turned shadow 


miqyas 


Straight shadow 


A table of sines and tangents—to use the modern terms—was compiled by 
Ahmad ibn Abdallah al-Marwazi al-Baghdadi (c. 770—870), a native of Marw, 
in present-day Turkmenistan. Known as Habash al-Hasib (the Calculator), 


35 Tt should be mentioned, in passing, that the title of this book was the origin of the word 
algebra through its Latin “translation” Liber algebre et almucabola. A work on arithmetic 
was translated as Liber Algoritmi de numero Indorum (Al-Khwarizmi’s book on Hindu 
numbering), and it was thus that the name of the region Khwarazm, around the present-day 
town of Biruni, in Uzbekistan, originated the word algorithm. 
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working in the court of al-Mamun and that of the next caliph Abu Ishaq al- 
Mu’ tasim ibn Harun, he may have computed the firs table of tangents ever. His 
tables were done in intervals of 1° and are accurate to three sexagesimal places 
(parts, minutes, and seconds) as opposed to Ptolemy’s, which are accurate only 
to the minutes. Be all this as it may, al-Hasib did not contribute to trigonometry 
in the usual sense. 

The work of Mohammed ibn Mohammed abu’|-Wafa al-Buzjani (940— 
998) °° had a greater impact on trigonometry. In 945 Baghdad was captured by 
Ahmad Buyeh, of north Persian descent, and the caliph remained a figurehead 
Four years later his son Adud al-Dawlah became caliph and founded the Buyid 
dynasty. He was a great supporter of mathematics, science, and the arts, and 
attracted Abu’1-Wafa to Baghdad, where he wrote several texts in mathematics, 
in 959. In 983 Adud al-Dawlah was succeeded by his son Sharaf al-Dawlah, 
who continued to support the sciences, and Abu’1-Wafa remained in Baghdad. 

In addition to elaborating trigonometric tables in smaller steps and with 
greater accuracy than his predecessors, he started a systematic presentation 
of the theorems and proofs of trigonometry as a separate discipline in math- 
ematics. In Chapter 5 of his Almjsty (_la..-oll), which is not a translation 
of Ptolemy’s Almagest but a separate work probably written after 987,°” he 
considered only the chord, the jyba, and the jyba of the complement, giving 
at once formulas for the jyba of a double arc and half an arc. In Chapter 6 he 
define the tangent or shadow in this manner: *° 


The shadow of an arc is the line [segment] drawn from the end of this arc 
parallel to the jyba, in the interval included between this end of the arc and 
a line drawn from the center of the circle through the other end of the same 
arc. 


36 This may be a good place to explain Arabic names. They usually start with a given 
name, such as Mohammed, followed by ibn (son of) and the name of the father. The 7 in 
ibn is silent, so that this word sounds more like bin, which is another frequent spelling. 
Further names of the grandfather and great-grandfather may be added next. Usually, the 
name ends with the town or region of origin, such as al-Buzjani, which means “the one 
from Buzjan,” located in present-day Iran. Sometimes there is a nickname at the end, such 
as al-Rashid (pronounced ar-Rashid because the / in al tends to acquire the sound of the 
next consonant). In some cases, the person being named had a famous son, whose name is 
added following the word abu (father of). Fate determines how a particular person should 
be known to posterity, and Mohammed ibn Mohammed ibn Yahya ibn Ismail ibn al-Abbas 
abu’1-Wafa al-Buzjani is usually known as Abu’1-Wafa. 

37 For this date see Woepcke, “Recherches sur l’histoire des sciences mathématiques chez 
les orientaux, d’apres des traités inédits arabes et persans. Deuxiéme article,” 1855, p. 256. 

38 The definition quoted here are from Delambre, Histoire de l’astronomie du moyen age, 
1919, p. 157. 
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Abu’l-Wafa’s statement is represented in the figure He called this trigono- 


miqyas 


shadow 


metric line the firs shadow or turned shadow. Then the secant, or 


diameter of the shadow, is the line drawn from the center to the end of the 
shadow. The straight shadow [cotangent] is the shadow of the complement 
of the arc. 


Next Abu’|-Wafa stated, in verbal form, four equations relating these and the 
former trigonometric lines, the firs of which, if R denotes the radius, can be 
written as Zz 

shadow jyb AB 


R jyb (90° — AB) 


This is clear from the similarity of the triangles shown in the previous figure 
Then Abu’l-Wafa chose R = 1 for the following reason [p. 420]:°° 


Thus it is evident that, if we take unity for the radius, the ratio of the jyba of 
an arc to the jyba of its complement is the shadow, and the ratio of the jyba 
of the complement to the jyba of the arc is the level shadow. 


This choice for the radius was a stroke of modernity, which he applied in the 
construction of his sine and tangent tables, but later writers did not adopt it. 

As an example of Abu’l-Wafa’s style in trigonometry, we return to Chap- 
ter 5 of the Almjsty and present his proof of the formulas now written as 


sin(a + #0) = sinacosé + cosasiné. 


He provided two calculations to obtain the sine of a sum or difference, and 
the firs one is as follows [pp. 416—417]: 


39 The remaining quotations from Abu’1-Wafa included here are English translations from 
a French translation of the relevant passages on trigonometry in Carra de Vaux, “L’ Almageste 
d’ Abt’ lwéfa Albtizdjani,” 1892, pp. 416—420. Ihave changed the words “sine” and “cosine” 
to jyba and jyba of the complement, and have mostly used “segment” as a translation of 
ligne. 
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Consider the two arcs AB, BC of a circle ABCD (fig 1) [these arcs are 
represented below in two possible arrangements]. The jybas of each of them 


are known. I say that the jyba of their sum and that of their difference are 
also known [can be found]. Join the three points A, B, C to the center O; 
from the point B drop to the radii OA, OC the perpendiculars BT, BH, and 
join HT. I say that [in the left figure HT is equal to the jyba of AC. In 
fact, prolong BH, BT down to D and Z and join DZ; HT will be equal to 
half of DZ, because the two segments BD, BZ are respectively divided into 
two equal parts at the points H and T. Therefore the arc DBZ is double the 
arc ABC, and the segment HT is equal to the jyba of the are CA. 


To restate this in current notation we use the central angles a = ZAOB and 
6 = /BOC rather than the arcs AB and BC that Abu’l-Wafa used. Then, by 
the bisecting nature of OA and OC, 


/ZOD = /ZOB+ £BOD = 2a + 26 
in the left figure and 
HT = 45DZ = 5crd ZOD = jyb(a+ 6) =sin(a +). 


The last equation is a consequence of the choice R = 1. Then, in order to deal 
with the sine of a difference, Abu’l-Wafa continued as follows [pp. 417—418]: 


In a second figur (fig 2) [the one on the right], the arc BDZ is double 
the arc BCA, and the arc BCD, double the arc BC; as a difference the arc 
DZ is double the arc AC. From the point B drop a perpendicular BN on 
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the line HT. The two angles BHO, BTO being right angles and built on 
the segment BO, the quadrilateral BOTH can be inscribed in a circle [see 
Proposition 31 of Book II of Euclid’s Elements]. The two triangles BOH, 
BTN, are similar since the angles at H and N are right angles, and the angles 
at O and T are equal, because these two angles are built on the segment BH 
[a chord in the circle BOTH]. Therefore the other two angles are equal to 
each other, and we have: 

NB HB. 

Br BO" 
but the lengths BT, BO, BH are known; thus we can infer BN, and the 
angle at N being right, we can obtain the segments NH, NT, and finall 
HT, which we wanted to prove. 


This proves, indeed, that HT can be obtained from the given jybas: BT = 
jyba and BH = jyb@, and the proof stated here is just as valid for the left 
figure But in this case, 


LZOD = LZOB — LBOD = 2a — 20, 
and therefore 


HT = 5DZ = 5crd ZOD = jyb(a — 6) = sin(a — 6). 


This concludes the calculation of HT, that is, the calculation of the sine of 
a sum or difference, but it does not give the promised and expected familiar 
formulas. However, Abu’l-Wafa would return to this topic, expressing himself 
in the following manner [p. 418]: 


Let us return to the two figure that we have drawn. The two triangles H NB, 
BOT are similar, because the angles BNH, BTO are right and the angles at 
Hand O are built on the segment BT [intheright figur itis the angles BHT, 
BOT that are built on BT, and since they are inscribed in the circle BOTH 
on opposite sides of the same chord they are supplementary]. Therefore we 
have: 

BH _ BO. 

HN OT’ 
but the lengths BH, BO, OT are known; thus HN will be known. Further- 
more, the two triangles BNT, OBH are also similar, because the two angles 
at H and N are right and the two angles at O and T are built on the segment 
BH. Then we have: 

BT BO. 

TN OH’ 
but the lengths BT, BO and OH are known; thus T N will be known. Hence 
the segment HT will be known which is, as we have already seen, the jyba 
of the are AC. 
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From these equations, and recalling that BO = 1, we immediately obtain 


HN = OT - BH =cosasin6, 
TN = BT - OH =sinacosé. 


Then, in the left figure 

sin(a + 0) = TH = TN + NH =sinacosé + cosa sin@, 
and in the right figur 

sin(a — 0) = TH = TN — NH = sinacos6 — cosasing. 


These are the well-known formulas in today’s notation. Abu’l-Wafa stated 


them in narrative form as follows:*° 


ls $3) Lg cls Cres cprugill spect ne Clem 
dats Js mam lope ls bss} Is! loghae bepie nots is 
Liay! yt Leliag Juan Li Golis oUt ple am & layie 
mae Gayl yl Lg iclis Frets (amyl spot cum 


which translates as 


Calculation of the jyba of the sum of two arcs and the jyba of their difference, 
when each of them is known. We multiply the jyba of each of these arcs by 
the jyba of the complement of the other, expressed in sexagesimal minutes, 
and we add the two products if we want to know the jyba of the sum; we 
subtract them if we seek the jyba of the difference. 


We saw before that the firs of the last two identities can be obtained from 
an interpretation of Corollary 1 of Ptolemy’s theorem. Abu’l-Wafa’s proof, 
by contrast, gives the two formulas in one single move, and, because of his 


40 Reproduced from Carra de Vaux, “L’ Almageste d’ Abi’ lwéfa Albizdjani,” p. 419. 
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choice of the radius, they are given in modern terms at once.*" 


The historian and general scholar Muhammad ibn Ahmad abu’!-Rayhan al- 
Biruni (973-1048), from Khwarazm, in today’s Uzbekistan, wrote an Exhaus- 
tive treatise on shadows about 1021, in which he gave several relationships 
among the various trigonometric lengths. But a comprehensive treatment of 
trigonometry in the Islamic world would appear only in the thirteenth century. 
While, if we were to talk in cinematographic terms, mathematics developed in 
slow motion—very slow motion in the tenth century—empires and dynasties 
evolve, in comparison, like the life of a fl wer in those film in which its birth, 
bloom, and death are viewed in the span of one minute, as if speed had been 
invented for the sole purpose of making mincemeat of history. 

Mincemeat is what Baghdad became in 1258 after being conquered by the 
Mongol leader Hulagu (or Hulegu), for he did not believe in mercy toward the 
defeated. He was the grandson of Temujin, also called Genghis Khan (possibly 
from the Chinese cheng-/i, “successful-lucky,” and the Turkish khan, “lord’’), 
who had managed to unite various Mongol tribes and began a large campaign 
of conquests in 1206. Hulagu expanded the Mongol empire to the west, and 
on his way to Ain Julat, near Nazareth, where he was defeated for the firs 
time, he accepted the surrender of the Hashashin of Alamut in 1256. This was 
a fortress in the mountains of central Iran (aluh amut means “eagle’s nest”) 
where this order of Nizari Ismailis had its center of operations, sending killers 
wherever they were deemed necessary to eliminte enemies and creating terror 
and unrest in the Islamic world.4# 

That this event was important in the development of trigonometry is due to 
the fact that Mohammed ibn Mohammed ibn al-Hasan al-Tusi (1201-1274), 
a gifted astronomer and mathematician and a native of Tus—a town right next 
to the location of modern Meshed in Iran—was at the Alamut fortress at that 
time. Upon its destruction by Hulagu, he joined his forces and was with them 
when they sacked Baghdad. 

Hulagu was interested in science and made al-Tusi his advisor in such 


41 However, Abu’l-Wafa’s proof could have been shortened by using Ptolemy’s theorem: 
Applying it to the quadrilateral BOTH in the left figur gives 


TH.BO=TO.-BH+BT.-HO, 
and in the right figur 
BT-HO=TO-BH+BO.-TH. 
With BO = 1 and the appropriate values of TH this gives the well-known formulas. 
42 Hashashin is a word of uncertain origin, about which much has been written without 
proof, from which the modern word “assassin” derives. For a history of this sect see 
Hodgson, The secret order of assassins, 1955. For the word assassin, pp. 133-137. 
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MOHAMMED AL-TUSI 


matters and also put him in charge of religious affairs.49 In 1262 he completed 
the construction of an observatory in his new capital, Maraga, near Azerbaijan, 
for which Mohammed al-Tusi designed several instruments and where he 
wrote tables and other works. One of these was the Kitab shakl al-qatta 
(Treatise on the transversal figure) in fi e books,** in which he gave the firs 
systematic presentation of trigonometry as independent of astronomy. In this 
treatise he stated and proved the law of sines for plane triangles and used it to 
solve triangles (this is the name for the process of determining the unknown 
angles and sides of a triangle when some are given). The law of sines was 
not entirely new, but not until the sixteenth century was originality one-tenth 
as valued as it is today. Compilation was more the name of the game, so that 
others could have a handy reference when applying a science to the practical 
world. The law of sines, in particular, was already implied in Ptolemy’s work, 
was proved by Abu’l-Wafa for spherical triangles, and was also stated and 
proved by al-Biruni. In fact, al-Tusi used and borrowed from al-Biruni’s 


43 Tt is possible that Mohammed al-Tusi’s connection with religious affairs is the reason 
for the name Nasir al-Din (frequently spelled Nasir Eddin), by which he is very often known. 
It means “Defender of the Faith.” 

ay eles)! JS OLS, an Arabic translation of a book that al-Tusi had originally written 
in Persian a few years before. Page references below are to the French translation by 
Carathéodory, Traité du quadrilatére attribué a Nassiruddin-el Toussy, 1891. 
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treatise The key to the knowledge of spherical figures and other figures while 
writing his own treatise. 

Mohammed al-Tusi ignored Abu’l-Wafa’s choice of a unit radius* and 
returned to R = 60. Then in Chapter 2 of Book III, entitled On the way to 
calculate the sides and the angles of a triangle the ones from the others [p. 66], 
he proposed two ways to do just that, the method of “arcs and chords” and the 
method of “arcs and jybas.” He explained firs the method of arcs and chords, 
starting with a right triangle and considering three subcases. Then he turned 
his attention to arbitrary triangles, dividing his study into four subcases, of 
which we shall need the last two [p. 69]:46 


IlI.— Two sides and one angle are given. ... That if the given angle 
is between the two given sides, as the angle A is between the two sides 


B B 


C OE A Cc A E 


AB AC, lower from B to AC the perpendicular BE’. You will thus have the 
right triangle AE B47 of which we know the side AB and the angle A; we 
get BE, EA, and thus we fall back into one of the preceding cases; that is in 
the case in which BE, CE are known; then from these we shall know BC 
and the angle C, as we have explained. 


In other words, crd2A is obtained from a table of chords, BE from the 
equation 
BE serd 2A 
BA 60_— 


45 Al-Tusi stated that Al-Biruni was the firs to choose unity as the radius [p. 212]. 

46 Since I am translating from a translation, I will do it as literally as possible to mini- 
mize the divergence from the original text. However, I have not been able to avoid some 
punctuation changes. 

47 The French translation says BE C, but the Arabic version has the correct triangle (Gg!) 
on page OY (53). However, the figur for the second triangle is mislabeled in Arabic. 
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EA from the Pythagorean theorem, CE = CA = EA, BC is also obtained 
from the Pythagorean theorem, and crd2C is found from the equation 


BE serd 2C 
BC 60 
Then 2C is found from a table of chords, and C is one-half of this value. 


IV.— The three sides of the triangle ABC are given.48 We would calculate 


A A 


C E OB E C B 


the perpendicular according to the common rule; taking the excess of the 
two squares of BA, BC over the square of AC, which we shall divide by the 
double of BC; the quotient will be BE 49 and then the root of the excess of 
the square of AB over the square of BE will give the perpendicular. Thus 
we shall have two right triangles whose angles can be determined, by means 
of which those of the triangle ABC can then be determined. 


This concludes al-Tusi’s presentation of the method of arcs and chords, 
and then, switching to the method of arcs and jybas, he stated the following 


[p. 70]: 


48 Note that the labeling of the vertices in the new figur for this case is different from the 
labeling in Case III. 

49 No proof of this fact is provided because this is “common” knowledge. However, for 
those without that knowledge it can be very simply proved if we denote the sides opposite 
the angles A, B, and C by a, b, and c, respectively. Then, if the perpendicular AE is 
denoted by h and if we put x = BE, we have c* = x* +h? and 


be =(a—xyP +f? =a? —2ax 42x? +h? =a? —2ax4+c’. 


Solving for x gives 
e+e-b 
2a 


x= 


which is al-Tusi’s statement. 
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As for that of the arcs and the jybas, the fundamental notion is that the ratio 
of the sides is equal to the ratio of the jybas of the angles opposite those sides. 


Next he restated it in a more formal manner, which we now call the law of 
sines. 


Let there be a triangle ABC, I say that the segment AB over the segment AC 
equals [the jyba of] the angle ACB over [the jyba of] the angle ABC.°° 


Sol & hl Gat a gl yaniTaue 21 clos! ail Choa nib 9a SI 
ghee dang SA Bess s~ pai! Noe cle alley 


wer 9 = JO oe 9 oy Cas de lb ol UIE a aes 
1 OS de ei 9ee C oral =o cA Lely oN ols 
Shox bor THs b de Val INS cach sie 
UI agf =o dae del oy y Ean) ali er oe teu 
ideal cols Sack at dl ial Bead ee sash dest alt 4 \2ils 
HeiT 1 NT aed OSG Coe HOt skh wld, Se Jl ball 
Per Bar | GPO a ee Sage oo | Oe Re | UO | Co | PR 
Wo Nbelss SO) wylgue Sob MN Sel we algce oe Beil 


Al-Tusi’s law of sines and his firs proof. 
From page 54 (0) of the Arabic translation of the Persian original, 
bound back to back with the French translation by Carathéodory: 
Traité du quadrilatére attribué a Nassiruddin-el Toussy. 


To prove it he considered three cases [p. 70], in which the angle at B (W) is 
obtuse, right, or acute. The firs and the third cases are illustrated in the next 
figure and, following al-Tusi’s lead, we shall leave the case of the right angle 
at B to the reader. 


50 Strangely enough, al-Tusi did not use the word jyba (C4>) in this statement, although 
it appears prominently in the preliminary statement and in the proof. 
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alta 


FEK H HK 


Demonstration.*" Prolong CB to CE = 60 [this may involve a contrac- 
tion rather than a prolongation, but the argument is the same]. From C with 
a radius equal to CE describe ED and prolong CA until it meets this arc 
in D. From D lower the perpendicular DF over CE, that will be the jyba 
of the angle ACB. Prolong BC equally to BH = 60. From B with a radius 
= BH describe HT which will be cut at point T, by the prolongation of 
AB. Lower the perpendicular TK, this will be the jyba of the angle ABC. 
From A lower AL perpendicular over BC, and because of the similarity of 
the triangles ABL, TBK you will have: 


AB _ TB (radius) 


AL TK 
in the same manner because of the similarity of the triangles ALC, DFC, 
AL DF 


AC DC (tadius)’ 


from which by the established proportion [multiplying the two preceding 
equations]: 
AB _ DF(jybAC B) 


AC TK(jybABC). 


Q.E.D.52 


Next al-Tusi gave a second proof, presumably his own, that is shorter than 
the first It is based on the next two figures which represent the two cases 
in which the angle at B of the basic triangle ABC is either obtuse or acute 
[p. 71]. 


51 This is al-Biruni’s original proof, which can be seen translated into English and in current 
notation in Zeller, The development of trigonometry from Regiomontanus to Pitiscus, 1946, 
p. 9. A different translation of this proof and the preceding statements of the law of sines 
can be found in Berggren, “Mathematics in medieval Islam,” in The mathematics of Egypt, 
Mesopotamia, China, India, and Islam, 2007, p. 642. 

52 T have used, and will continue using, modern fractions rather than the symbol : which 
the French translation uses for division. The stated equations are just part of the narrative 
in both the French and the Arabic versions. 
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H 


Lower AE perpendicular to BC; prolong AB, AC just so that AF = AD = 
60 = R. Describe the arc DH. Bring DT, F K perpendicular onto AH. In 
the triangle ABE the angle E being a right angle, B will be the complement 
of A; DT is the jyba of the angle A; TA is the jyba of the angle B. Similarly 
in the triangle AEC, the angle C being the complement of the angle A 
KF is the jyba of the angle A; KA is the jyba of the angle C. And because 
of the similarity of the two triangles ABE, ADT; 


AB _ AD (radius) | 
AE AT (jybB)’ 


in the same manner, due to the similarity of the two triangles AEC, AK F, 


AE _ AK (jybC) 
AC _ AF (radius) 


which gives us [multiplying the last two equations] 


AB _ KA(jybC) 
AC AT(jybB) 


Q.E.D. 


Then al-Tusi used this theorem to solve triangles, starting with the case of 
right triangles [pp. 71-72]. 


Thus if we are dealing with a right triangle, if we know its sides, by the 
method of the jybas [the law of sines], the hypotenuse will be to one of the 
sides in the ratio of the radius [= jyb 90°] to the jyba of the angle opposite 
that side, and through the jyba one find the angle. And if that which is given 
is one angle and one side we shall know from those the angles of the right 


53 The French translation reads A + C = arc de demi-circonférence. 
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Al-Tusi’s second proof of the law of sines. 
From page 55 (00) of the Arabic translation of the Persian original, 
bound back to back with the French translation by Carathéodory: 
Traité du quadrilatére attribué a Nassiruddin-el Toussy. 


triangle, and the ratio of the jyba of the angle opposite the known side to the 
Jyba of the other angle will be equal to the ratio of the given side to the other 
side. Through that the sides will be known. 


While al-Tusi dealt with arbitrary triangles in one single condensed para- 
graph [p. 72], we shall interrupt it three times to insert explanations using 
today’s mathematical writing. To this end, consider an arbitrary triangle ABC 
and denote the sides opposite the angles A, B, and C bya, b, and c, respectively. 


As for the other triangles, if one knows two angles and one side, the other two 
sides will be found according to what we have said about the right triangle. 
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That is, if A and B are the given angles, then C = 180° — (A + B) is found. 
Now assume that the known side is a. According to the law of sines, 

b _ jybB c  jybc 

a jybA a jybA 
In each equation three of the four quantities are known, and then b and c can 
be found. 


If what is given is two sides and one angle (not between the sides) the ratio of 
the side opposite the known angle to the other side will be equal to the ratio 
of the jyba of the known angle to the jyba of the angle opposite the other side. 
And when you have known the angles you would also know the remaining 
side. 


That is, assume that A, a, and b are the given items. Then B is determined 
from ; 

b  jybB 
a jybA 
Now two angles and one side are given, and the triangle is solved as in the 
previous case. 


If the angle is between the two given sides one would proceed as it has been 
said. 


Now, this is really a short explanation, but al-Tusi is right. This is just Case II 
of his method of arcs and chords, shown on page 45. Then he concludes with 
the last of the present cases: 


If the given consists of the three sides, fin firs the perpendicular as it has 
been said, after which you will be able to know the angles as you would fin 
them in the case of a right triangle. 


We need to add to this explanation by saying that in Case IV of his solution 
of triangles by the method of arcs and chords (page 46), and with reference to 
the figur reproduced there, al-Tusi had already evaluated BE and AE. Then 
EC is easily obtained from BC and BE, so that in each of the right triangles 
ABE and AEC two sides and the right angle between them are known, a case 
that has already been covered. 

Then al-Tusi concluded Chapter 2 with the following words: “Here ends 
what we were going to say about triangles.” But this is not the end of his 
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application of the law of sines, for in Chapter 3 of Book II, entitled Rules 
whose knowledge is very useful in the theory of the plane quadrilateral, he 
also used it to prove several of these rules or lemmas. Of these we present 
only the firs as a sample [p. 73]. 


Let there be ina circle ABC, two arcs AB, AC whose ends touch at the point A: 
their sum BAC < 3 the circumference is given, as well as the ratio of jyb AB 
over jyb AC. I say that AB, AC can be determined. 


ioe 
e-—\ 


Demonstration.>4 Draw the chord BC and the diameter AE which intersect 
in D. Drop from the center, FH perpendicular over CB, join BF’. The arc 
BAC being known the chord BC is known too [from a table of chords]; 


jybAB 
jyb AC 


being known [the ratio] DB/DC ... is also known.°° 


It might be appropriate to interrupt the proof to show why this is true. Visualize 
two perpendiculars from B and from C to AE and denote their intersections 
with AE by B, and C,, respectively. Note that BB, = jyb AFB = jybAB 
and that CC, = jyb ZAFC = jyb AC. This and the similarity of the triangles 


54 A different translation of this lemma and its proof can be found in Berggren, “Mathe- 
matics in medieval Islam,’ in The mathematics of Egypt, Mesopotamia, China, India, and 
Islam, p. 643. 

55 The omitted short passages in this quotation and the next one refer to some segments, 
external to the circle in question, that may be relevant in the rest of al-Tusi’s Chapter 3 but 
are not relevant to this proof. 
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BDB, and CDC, gives 


DC _ BB, _ jybAB 


DB CC, jybAC 


which shows that the ratio on the left is also known. Then al-Tusi continued 
as follows: 


We deduce: 
DB+DC=BC 


DB 


... [is known too]. DB, BC, BH (BC/2) and consequently DH and FH 
(jyba of the complement of half the arc BAC 86 are thus known segments 
[DH = BH — DB and FH = 4/60? — BH?]. Now in the right triangle 
DHF the two sides of the right angle are known and consequently the angle 
DFH. Moreover BFH (which corresponds to half of the arc BC) is known. 
Thus we come to determine the angle BFA, which corresponds to the arc 
BA, and hence the arc AB itself, Q.E.D. 


Thus we see that al-Tusi’s treatise prominently displays trigonometry and 
its applications, although the only trigonometric length that he used in the 
work described to this point is the jyba. However, in Chapter 6 of Book V, 
entitled Of the figure called shadow, of its consequences and of its accessories, 
al-Tusi define all six trigonometric lengths and gave some of their properties. 
Before going on to prove some results in spherical trigonometry, he stated the 
following [p. 164]: 


Let there be the circle ABCE with center at D and the arc AB. Draw the 
diameters passing through the points A and B, and from the point A raise the 
perpendicular AF to AC, which will meet the diameter BE at F 57 AF is 
the shadow of the arc AB, parallel to BH which is the jyba. Similarly, raise 
on AC the perpendicular DT through the center. The perpendicular TK will 
be the shadow of the arc TB, while BL is the jyba; in other words, TK and 
BL will be the shadow and the jyba of the complement of the arc AB. What 
we have called shadow, astronomers call the first shadow or turned shadow 
of the arc AB. ... they call TK, the second shadow of the arc AB, or the 
straight shadow; F D is for them the diameter of the Ist shadow, and KD 
the diameter of the second shadow ... 


56 Half the are BAC is equivalent to the angle BFH, and its complement is the angle 
FBH. 
57 For some reason, this point is not labeled in the Arabic manuscript. 
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That makes six trigonometric lengths, that last two of which we now call the 
secant and the cosecant. 

Then al-Tusi stated the following relationships for the new trigonometric 
lengths [pp. 164-165]: 


The firs shadow of every arc is the second shadow of its complement and vice 
versa; the ratio of the shadow to the radius is equal to ratio of the jyba of the 
arc to the jyba of its complement; the ratio of the shadow to the diameter of 
the shadow is equal to ratio of the jyba to the radius, because of the similarity 
of the triangles FAD, BHD; but the triangles AFD, K TD being also similar 
one has 

FA _ DT=AD «. 

AD TK ° 
hence the radius is the mean proportional between the shadow of an arc 
and the shadow of its complement, and the shadows of two arcs are as the 
reciprocals of the shadows of their complements. 


In other words, we have some familiar trigonometric identities, starting with 


Rtana = Rcot(90° — a) 


and ending with 
tana coté 


tand cota 
While some of these identities may be considered unexciting today, the fact 
is that the introduction of the six trigonometric functions and the relationships 
between them gave an agility to the applications of trigonometry that it did 
not have in the times of just arcs and chords or arcs and jybas. 


58 Tn the French translation the last denominator appears as KD, which contradicts the 
text following this equation, but it is correct in the Arabic version on page \YY (127). 
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After this, little progress was made in trigonometry in the Islamic world 
except in the elaboration of fine tables. Worthy of mention are those com- 
pleted by Ghiyath al-Din Jamshid al-Kashi (of Kashan) in 1414 as a revision 
of earlier ones by Mohammed al-Tusi. They were dedicated to the Great Khan 
Ulug Beg, a generous patron of the arts and sciences, whose capital was at 
Samarkand. Of great merit is al-Kashi’s interpolation algorithm based on the 
approximate solution of a cubic equation.°9 


1.5 TRIGONOMETRY IN EUROPE 


Abd al-Rahman, one of the Umayyads to escape the massacre by Abu al- 
Abbas al-Saffah in 750, managed to reach the Iberian peninsula. In 756 he 
proclaimed himself emir in Cordoba and in 773 he became independent of 
Baghdad. Abd al-Rahman III, also of the Umayyad dynasty in Cordoba, who 
ascended to the throne in 912, turned the emirate into an independent caliphate 
in 929. Islamic culture had already started to flouris under the emir Abd al- 
Rahman IJ. When in the ninth century the largest library in Christian Europe, 
at the Swiss monastery of Saint Gallen, had a grand total of 36 volumes, that 
of Cordoba housed 500,000, one per inhabitant. 

Abd al-Rahman III was determined in his resolution that the splendor of 
his court at Cordoba would outshine that of the eastern Abbasid caliphate. 
He brought artists and scientists to Cordoba, and it was in this way that a 
large part of Islamic knowledge reached Europe, to be augmented by some 
of the local scientists. Contributions to trigonometry were only minor, and 
they took place after the decomposition of the Cordoba caliphate into smaller 
independent kingdoms. But because of their later influenc we should cite the 
Zij tulaytulah (Toledan tables) of Abu Ishaq Ibrahim ibn Yahya al-Naqqas (the 
Chiseler) al-Zargel (the Blue-eyed, referring to his father) (1029-1100), a 
practical astronomer who made observations in Toledo and devoted several 
pages to the use of the cotangent. Abu Mohammed Jabir ibn Afla al-Ishbili 
(from Seville) (c. 1100—c. 1160), frequently known as Geber, the Latinized 
form of his name, wrote the Islah al-mjsty or Correction of the Almagest, 
which would be influentia some centuries later. 

But the accumulation of knowledge in Muslim Spain would have been 


59 For details see Berggren, Episodes in the mathematics of medieval Islam, 1986, pp. 151— 
154. 

80 Many historians use the form al-Zarqali, but this cannot be considered correct for the 
reasons given by Millas Vallicrosa in Estudios sobre Azarquiel, 1943, pp. 13 and 21—22. 
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useless if such a wealth of books had remained unread by future generations. 
However, in 1085 Alfonso VI of Castile, the Valiant, conquered Toledo by the 
civilized method of convincing King Ismail ibn Yahya al-Qadir that resistance 
was futile, and entering into an agreement for a peaceful take-over. Less than 
a century later, the archbishop Raimundo de Sauvetat was pondering what to 
do with so many old volumes at the Cathedral of Toledo, and decided to make 
them available to scholars. This was the origin of the Escuela de Traductores 
de Toledo (Toledo School of Translators), which became one of the main 
avenues for the dissemination of old knowledge to Europe. 

Many scholars floc ed to Toledo to translate those works, among them 
Adelard of Bath (1075—1160), who translated the astronomical tables of al- 
Khwarizmi—trevised by Maslama ibn Ahmed al-Madjriti (of Madrid) in the 
tenth century—in 1126 and Euclid’s Elements in 1142. In 1145 Robert of 
Chester (1110-1160) translated al-Khwarizmi’s Kitab al-mughtasar fi hisab 
al-jabr wa’l-muqabala (The compendious treatise on calculation by restora- 
tion and reduction) with the title Liber algebre et almucabala, which gave us 
the word algebra. Gherardo de Cremona (11 14—1187) translated more than 80 
texts from Arabic to Latin, including Ptolemy’s A/magest—for which purpose 
he went to Toledo in 1167 and remained there for life—Euclid’s Elements, 
and al-Zarqel’s Toledan tables. Johannes Hispalensis, or Juan de Sevilla (died 
1180), translated al-Khwarizmi’s book on Hindu numbering. 

Many other translators, local and foreign, contributed to this endeavor: 
Petrus Alphonsi (Moshé Sefardi), Rudolf of Bruges, Dominicus Gundisalvi, 
Michael Scot, Marcos de Toledo, Alfred of Sareshel, Plato Tiburtinus also 
known as Plato of Tivoli, Abraham Bar Hiyya, Hugo de Santalla, Hermann 
of Carinthia (Hernan Aleman), Abraham ben Ezra ... the list is endless. This 
translation activity was an essential component of what has been called the 
twelfth-century Renaissance, and it certainly paved the way for the full-fledge 
Renaissance of the fifteent century.®' 

Some of these translations are particularly noteworthy for linguistic rea- 
sons. The translators, unable to make any sense of the word jyb and taking it 
to be jayb, one of whose meanings is bosom or any object similarly shaped, 
translated it as sinus, which has the same meaning in Latin. We know that 
the word appears in Gherardo de Cremona’s translation of al-Zarqel’s Toledan 


81 Although Toledo was the main translation center, other translations were made in 
Barcelona, Leon, Pamplona, Segovia, and Tarazona in Spain; Béziers, Marseilles, Nar- 
bonne, and Toulouse in France; and in the Norman court of Roger II of Sicily. For full 
details see Chapter IX in Haskins, The Renaissance of the twelfth century, 1927. 
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tables, Canones sive regule super tabulas toletanas, as follows: “Sinus cuius 
libet portionis circuli est dimidium corde duplicis portionis illius” (The sine 
of any portion of a circle is one half the chord of double the portion). It is, 
however, possible that the same rendering of jayb into sinus was made earlier 
by other translators. The names of Plato Tiburtinus and Robert of Chester 
have been mentioned in this connection.® 

With this preparatory work, Europe was ready to embrace and improve on 
trigonometry. The translators had rendered the turned shadow and the straight 
shadow into umbra versa and umbra recta, and these trigonometric lengths 
were used in Europe. At Oxford University, John Manduith (c. 1310) used 
them in his Small tract, and Richard of Wallingford (c. 1292-1335) in his 
Quadripartitum de sinibis demonstratis. In Paris, Jean de Linieres (1300— 
1350), a follower of al-Zargel, clearly define the umbra recta and umbra 
versa in his Canones super tabulas primi mobilis, astronomical tables for the 
Paris meridian. 

The stage was set for the firs systematic treatment of trigonometry in 
Europe. It was written by Johannes Miiller (1436-1476), also known as Re- 
giomontanus. A true man of the Renaissance, he adopted the Latin version, 
Regius Mons, of the name of his hometown of K6nigsberg (King’s Mountain), 
becoming Joannes de Regio Monte. Actually, he was born at Unfinden near 
KGnigsberg. He went to the Academy of Vienna as a student, where he met 
Georg Peurbach in 1451 or 1452, and they became like father and son. In 
1461 he was appointed professor of astronomy at the University of Vienna, 
but he did not enjoy this position for long. At some time, Peurbach had in- 
troduced Regiomontanus to Cardinal Basilios Bessarion, who, like Peurbach, 
was interested in a definit ve translation of Ptolemy’s A/magest (previous Latin 
translations of bad Arabic translations were shunned). When Peurbach died 
at 37, in 1461, it was up to Regiomontanus to finis_ the task. 

He and Bessarion left for Italy in search of reliable documents in 1462. 
There Regiomontanus learned Greek and Arabic, finishe Peurbach’s work as 
Epytoma Joannis de Monte Regio in almagestum Ptolemei in 1463, and wrote 
his own compilation of trigonometry, which he finishe in June 1464. The 
title of this work, a systematic treatise on trigonometry, as independent from 
astronomy and even more inclusive than al-Tusi’s, is De triangvlis omnimodis 
(On triangles of all kinds). It consists of fi e books, the firs two of which 


82 From Smith, History of mathematics, Il, p. 616. 
83 The firs by Cajori, A history of mathematics, 1919, p. 105. The second by Boyer, A 
history of mathematics, 1968, p. 278. 
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JOHANNES MULLER 


From Ludwig Bechstein, ed., Zweihundert deutsche Mdnner in Bildnissen 
und Lebensbeschreibungen, Georg Wigands Verlag, Leipzig, 1854. 


are devoted to plane trigonometry and the rest to spherical trigonometry (in 
the fourth book he borrowed heavily from Jabir ibn Afla without giving him 
credit, for which he was to be chastised later by Cardano). It was eventually 
printed in 1533 and reprinted in 1561 and 1967.4 

Book lof De triangvlis contains fifty-s ven propositions, of which numbers 
20 to 57 deal with the solution of triangles. The only trigonometric function 
used in this book is the sine, which appears in Propositions 20, 27, and 28. 
Here are the statement and proof of this last proposition [pp. 66—67]: 


XXVIUL. 


When the ratio of two sides of a right triangle is given, its angles can be 
ascertained. 


84 Hughes, Regiomontanus on triangles, 1967. Page references are to this edition, and, 
except for very minor changes, the translations included here are by Hughes. 


SECTION 1.5 TRIGONOMETRY IN EUROPE 59 


One of the two sides is opposite the right angle or else none is. First, if 
side ab, whose ratio to side ac is known, is opposite the right angle acb, I say 
that the angles of this triangle become known. In fact, ac is the sine of the arc 
of angle abc by the above, provided that ab is the semidiameter; that is, the 
whole sine [sinus totus] of the circle. Therefore, the ratio of the whole sine to 
the sine of angle abc is known. Hence the latter sine can be found, and finall 
the angle abe will be known. However, if the ratio of the two sides be and ac 
is given, then the ratio of their squares will be known. And by addition [of 
unity to the last ratio and simplification the ratio of the sum of the square 
of be plus the square of ac—that sum being the square of ab because of the 
right angle <—to the square of ac will be known, whence the ratio of the lines 
is known. And what remains is as before. 


To rephrase this using current notation, we consider two cases: 


1. If the ratio ab/ac is given and if ab is the semidiameter, the equation 


ab sinus totus 


ac sinus abe 


(where we have written sinus instead of sin as a reminder of the fact that the 
sinus is the length of a segment and not our usual trigonometric function), 
allows us to determine sinus abc and then the angle abc from a table of sines. 


2. If the ratio be/ac is given, then 


ab” be? + ac bc\? 
= 22 = tl, 


ac 


85 As Hughes explains, Regiomontanus was referring to his previous Proposition 20, which 
just stated the definitio of the sine of an angle between 0° and 90° as the sine of its arc. 
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from which the ratio ab/ac becomes known, and then we finis as in case 1. 


After the proof, Regiomontanus included a guide to the “operation” of the 
theorem; that is, computational instructions, starting with: “If one of the two 
sides is opposite the right angle, multiply the smaller term ...” He concluded 
with an example in which the ratio of ab to ac is 9 to 7. This was a typical 
arrangement that he employed throughout the book. 

Book II, which contains an organized treatment of trigonometry in 33 
propositions, starts with the law of sines [pp. 108-109]: 


I. 


In every rectilinear [as opposed to spherical] triangle, the ratio of one side to 
another is as that of the right sine of the angle opposite the one to the right 
sine of the angle opposite the other. 


Regiomontanus explained, with reference to the enclosed figure what the 


o 


ekh l g 


proposition says. Using our terminology, this is 


ab sinus agb 
— = —— and —=-— 
ag sinus abg bg sinus bag 


ab sinus agb 

Then the proof proper begins with the statement that if abg isa right triangle, 
this is [Proposition] 28 above. If not, but if ab and ag are equal, then the angles 
opposite these sides are equal and hence their sines are equal. But if one of 
the sides is longer than the other, say ag is longer than ab, then prolong ba to 
bd so that the length of bd equals that of ag. The he drew the arcs in the figure 
probably because he was following al-Tusi’s firs proof, which has the arcs. 
Drawing perpendiculars ak and dh to the base bg, it is evident that ak is to dh 
as the sinus of angle agb is to the sinus of angle abg. Moreover, in view of the 
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similarity of the triangles abk and dbh, 


ab abs ak ___ sinusagb 


ag «dH sinus abg. 


“which is certain, and that is what the proposition asserted.” 
Asasample of his use of this theorem in the solution of triangles we present 
his seventh proposition [pp. 112-115]. 


VIL. 


If the perimeter of a triangle is given with two of its angles, any one of its sides 
will be known. 


Clearly, if two angles are known then all three are known, and (a shorter 
proof will be given after the quotation) 


therefore, by the reasoning often cited [the law of sines], the ratio of ab to ag 
will be known, and then by addition [of 1] the sum of ba plus ag will have 
a known ratio to line ag. Similarly, the ratio of ag to gb will be known, and 
the ratio of ba plus ag to gb will be found [as the product of the two previous 
ratios]. Thus, by addition [of 1] the total perimeter of triangle abg has a 
known ratio to line bg. And since the hypothesis gave the perimeter itself, 
line bg will be known. Hence the other two sides will also be declared known. 


Or, more simply, since the ratio of any two sides is known by the law of sines, 
then 
ab + bg + ga 
ab 


is known, so that line bg will be known. Similarly for the other two sides. 
Two features of this second book are remarkable. The firs is the use of al- 
gebra, specificall quadratic equations, in the proofs of Propositions 12 and 25. 
The second is the fact that Regiomontanus implicitly used the trigonometric 
formula for the area of a triangle in the proof of Proposition 26, reproduced 
next. This book was extremely influentia in Europe and highly praised by 
Tyge Brage (a name that he Latinized as Tycho Brahe),® a Danish astronomer 


86 This spelling of this name is from what appears to be his signature (but the fourth letter 
of his last name is less than clear). It is reproduced below his portrait by Hans Peter Hansen 
in Frederik Reinhold Friis, Tyge Brahe: en historisk fremstilling. Efter trykte og utrykte 
kilder, Gyldendalske Boghandel, Copenhagen, 1871. 
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xXVIL 
Dataareatrianguli cum eo,quod fub duobus|ateribus continetur 
retangulo,angulus quem bafis refpicit,aut cognitus emerget, aut cu 
angulo cognito duobus rectis aquipollebit, 
a . Refumptis figurationibus precedetis, 


fi perpendicufaris b k uerfuslined a g p 
cedens,extra triangulum ceciderit,erit per 


ea, que in precedenti commemorauimus, 
proportio b k ad b a nota,&ideo per 
1 primi huius angulum b a k notum acciz 
piemus, fic angulus b a g cum angulo b 
4 g 2 knotoduobus rectis equivalebat. Siue 


ro perpendiaularis b k intra triangulum 

ceciderit,quemadmoda in tertia figuratio 

ne precedentis cernitur, erit ut prius a b 

ad b k notahabens proportione , & ideo 

angulus b a kfiue b.a g notus conclude 
tur.Atfi perpendicularis b k coinciderit lateri a b, necefle eft angulum b a g 
fuifle rectum,& ideo cognitum,quod quidem accidit,quando area trianguli pros 
pofiti zquatur aqsotiibanden lateribus eius continetur rectangulo, 


Reproduced from Regiomontanus, De triangvlis omnimodis, p. 58. 


who studied a supernova that became visible in the sixteenth century. He stated 
that he “found the longitude and latitude of this new star with the help of the 
infallible method of the doctrine of triangles,” and that a good part of the 
propositions that he used were from the fourth book of Regiomontanus.®” 

The second edition of De triangvlis contains some additions. There are 
two tables of sines, one based on a sinus totus of 6,000,000 parts and another 
in which it is subdivided into 10,000,000 parts. Those were times in which 
decimal numbers had not been introduced and it was convenient to compute 
the sines as whole numbers. To obtain precision, the radius or sinus totus must 
then be a huge number. A shorter table of only 95 sines was also included 
based on a radius of 600,000,000 parts. In spite of the amount of effort needed 
to compute them, they would soon become obsolete. 

However, both editions of this book represented a setback in one direction: 
that Regiomontanus used only the sine function, thereby limiting the agility 
with which trigonometry could be applied. But he used the umbra versa from 
1465, the year after he finishe De triangvlis. 


87 Tychonis Brahe, Dani De nova et nvllivs evi memoria privs visa stella, iam pridem 
anno a nato Christo 1572. mense Nouembrj primum conspecta, contemplatio mathematica, 
1573, f. B4” = Opera omnia, 1648, p. 357. 
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About 1467 Regiomontanus left Italy and found employment managing the 
manuscript collection of King Matthias I Corvinus of Hungary. In 1467 he 
finishe the Tabule directionum profectionumque, possibly printed privately 
in 1475, which includes 31 astronomical problems but is mostly devoted to 
tables. One of these, which he called tabula fecunda, is a table of tangents. 
It was used in solving his tenth problem, after which he stated that he chose 
the name fecunda because “like a fertile tree it produces many marvelous and 
useful [results].” © 

Notable among those who worked in trigonometry after Regiomontanus 
was the astronomer Mikotaj Kopernik (Latinized as Nicolaus Copernicus) 
(1473-1543). Some trigonometry appeared in his great work De revolvtion- 
ibvs orbium celestium of 1543, but it had been published separately the year 
before under the title De lateribvs et angvlis triangulorum. 

Georg Joachim von Lauchen (1514—1576),’° usually known as Rheticus 
because he was born in the former Roman province of Rhetia, was a young 
professor of mathematics at Wittenberg in 1539 when he resigned his ap- 
pointment to become a student of Copernicus in Poland. He learned that 
Copernicus had trouble completing the demonstrations of his trigonometry, 
and that he even thought of discontinuing his work, but Rheticus encouraged 
him to persevere until he eventually found the necessary demonstrations. 

Rheticus returned to Wittenberg in 1541 to supervise the printing of De 
revolutionibvs, which may never have happened otherwise. He added a table 
of cosines of his own to the De lateribvs, the firs table of cosines ever. From 
Wittenberg he moved to Nuremberg, then to the University of Leipzig, and 
then to Krakow, where he became a practicing doctor of medicine for the next 
twenty years. He still found time for mathematics, and even received funding 
from Emperor Maximilian II, which enabled him to employ six research assis- 
tants at some time. With their help, he elaborated a table of all trigonometric 
lines based on a radius of 10,000,000 in intervals of ten seconds. Then he 
began a table of what we now call tangents and secants based, for extra preci- 
sion, on a radius of 1,00000,00000,00000, but he died in 1576 before he was 
able to finish The tables were completed by his pupil Lucivs Valentinvs Otho 


68 Translated from page 13 of the 1490 edition, which has unnumbered pages. 

89 The plane trigonometry appears in De revolvtionibvs in Chapter 13, pp. 19,-21, with 
minor differences from that in De lateribvs. The tables are in Chapter 12, pp. 12-19, and 
have one fewer place of accuracy than those in De lateribvs. 

70 When his father was executed as a convicted sorcerer, the law forbade his family the use 
of his last name: Iserin. Then Georg Joachim used his Italian mother’s last name de Porris, 
which means “of the leeks.” Subsequently, he translated it into German as “von Lauchen.” 
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6]__ 103551] 
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6a! 1430203 


Pues =) 


Reproduced from page 57 (unnumbered) of Tabule directionu profectionuq|ue] 
famosissimi viri Magistri Joannis Germani de Regiomonte in natiuitatibus 
multum vtiles, 1490. 


(c. 1550-1605) and published in the second volume of the Opvs palatinvm de 
triangvlis in 1596. The firs volume contains four trigonometric treatises, of 
which one is due to Otho. Rheticus is chiefl remembered for his magnificen 
trigonometric tables. In addition to those in the Opvs palatinvm, he began 
a table of sines based on a radius of 1,00000,00000,00000, which was also 
completed by Otho in 1598 and published as the Thesaurus mathematicus. 
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1.6 FROM VIETE TO PITISCUS 


Most notable among the many people who worked on trigonometry in the 
sixteenth century was Francois Viéte (1540—1603), also known as Vieta after 
the Latin form of his name. He was a significan contributor and left us quite a 


FRANCOIS VIETE 
From Smith, Portraits of Eminent Mathematicians, I 


number of well known and not so well known trigonometric identities. Viete 
was a lawyer in the Parlement de Paris and then a personal advisor to kings 
Henri III and Henri IV of France, which did not prevent him from becoming, in 
his spare time, the best mathematician of the sixteenth century. Viéte was the 
firs to understand the generalities underlying the particulars and, in that sense, 
the firs man in the new Europe to really deserve the name mathematician. 
He wrote extensively on trigonometry, starting with his firs published work: 
Canon mathematicvs sev ad triangvla cum adpendicibus (Mathematical table, 
or about triangles, with appendices), written in four books, of which only the 
firs two were ever published. In the firs of these he gave tables of minute- 
by-minute values of all six trigonometric lengths. 
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The next illustration shows the second page of the firs table. In the top 


SEV, AD TRIANGYLA, 
ea Trtanaver, Pramr Raotavevii 


ro, Bt 174} RYU — 
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Second page of the table in Viete’s Canon mathematicvs. 


Reproduced from the virtual exhibition El legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla. 


part we read the words Hypotenusa, Basis, and Perpendiculum. These refer to 
a right triangle one of whose legs is horizontal, which he called the base, and 
the other vertical, which he called the perpendicular. These names already 
appear in the works of Rheticus, who used perpendiculum for what we now 
call the sine and the tangent, basis for the cosine, and hypotenusa for the 
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secant. Below we read canone sinuum and canone feecundo. They mean sine 
table and tangent table. He may have taken this name for the tangent from the 
tabula fecunda of Regiomontanus. 

But we have stated that he used all six trigonometric lengths. He referred 
to the cosine, the cotangent, and the cosecant as the sine, the tangent, and the 
secant of the rest (the “rest” meaning the complement, or 90° minus the origi- 
nal acute angle). In Latin, “of the rest” is residue, and thus fecundus residue 
means cotangent and hypotenvsam residue means cosecant. The second book 
of the Canon mathematicvs, individually titled Vniversalivm inspectionvm ad 
canonem mathematicvm, liber singularis (General examination of the math- 
ematical tables, single book), is devoted to the solution of triangles from a 
number of trigonometric formulas, such as the following [p. 11]: 


DIFFERENTIA inter Feecundum & Hypotenvsam Residue, est Feecundus Dimidize 
peripheriz. 


If we accept that the qualifie Residue applies to both Facundum & Hy- 
potenvsam, and if peripheriew, meaning arc, is denoted by 0, this statement is 
equivalent to the trigonometric identity 


coté@ —cscé = tan 50. 


The next work of Viéte’s containing a result that can be considered trigono- 
metric is Ad logisticem speciosam, note priores.”' In the last section of this 
paper, On the genesis of triangles, we fin the following result [p. 72]: 


PROPOSITION XLVIII. 


From two equal and equiangular right triangles, to construct a third right 
triangle. 


First the notation is established as follows [p. 72]: 


Let there be two right triangles with these common sides: A the hypotenuse, 
B the perpendicular, and D the base. 


™ English translation by Witmer as “Preliminary notes on symbolic logistic?’ in The 
analytic art by Francois Viete, 1983, pp. 33-82. Page references are to and quotations from 
this translation. 
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36 Av Locisrircen Spec. 

fis, equatut quadrato adgregati ex bale primi, 8 perpendiculo fecundi, plus quadrato 
differentia oe epernonda primi, ty bafin fecandi vel etiam, ab adra~ 
to adgtegati ex perpendiculo primi & bafe (ecundi , plus quadrato differentia: inter 
bafin primi & perpendiculum fecundi. 


Prorosrtro XLVIII, 


A Duobus triangulis rectangulis xqualibus & xquiangulis , rertium 
triangulumreangulum,conftituere, 


Suntoduo triangala rectangula,quoramcommunialatera hyporenula quidem A, 
petpendiculum B, baGisD. Oporteat abillis duobus tertiam criangulam rectangolum 
conftituere. Fiardeductio ficut docuit Propofitio 4¢.cafu primo. Deducienim tantum 
potelt (ynarefcos via ,non aurem dizrefcos. Fit hypotenufa fimilis A quadraco.Balis,D 
quadrato ==B quadraro.Perpendiculum Bin D 2, Tertium autem illud,vocetuctriangu- 
lum angulidupli, 5¢ ejusrefpectu primum vel fecundum dicetur anguli {impli ob cau- 
fas * {uo ponendas loco. 

* Cauffa oH quod angulus acutus trianguli re Triangulum anguli dupli. 

Etanguli a duobus pe, via {ynarefeos effetti a- pel —— 
quetur angulis acutio horum trianguloram fimul 

adgregatis , cujue theortmatit converfim demon- 

Fravit Anderfonus theoremare fécundo {eétionum 

angularium. Porto acuti voce inteligitor i angulue, DinB2 

cui Perpendiculum fubtenditur. AT. 


Provositio XLIX, 
A Triangulo re@angulo fimpli, 8& DI BY. 
triangulo rectangulo anguli du- 
pli , criangulum redangulum effingere. Vocerur aurem rertium illud, 
triangulum anguli tripli. 


Neither the statement of the proposition nor the notation clarifie what the task 
really is, but from explanations previously provided by Viéte and a look at the 
figur that he provided, shown above, the third rectangle is to be constructed 
such that its hypotenuse is A? (Aq. in Viéte’s notation, as an abbreviation of 
A quadrato, or A square) and the acute angle on the right is twice that of the 
original angle (anguli dupli). 

The proof, based ona previous Proposition 46, is not sufficientl interesting 
for inclusion here, but the result is, and it is contained in the already mentioned 
figure the new base and perpendicular are Dg == Bq and D in B2, meaning 
D*—B? and2BD, respectively. That this is a trigonometric result follows from 
the following interpretation in current notation and terminology. If the angle 
formed by A and D in the original triangle is denoted by 6, then D = Acosé@ 
and B = Asin@. Then, in the new triangle we have 


A’ cos 20 = D* — B? = A*(cos* 6 — sin’ 6) 
and 
A’ sin20 = 2BD = 2A’ sin@ cos@. 


This is the way in which Viéte discovered the double angle formulas, previ- 
ously found by Abu’1-Wafa. 
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In the next three propositions he considered the cases of the triple, quadru- 
ple, and quintuple angles, and from these particular cases, Viéte noticed a 
pattern for the expressions giving the base and the perpendicular (what we 
would now call cos n@ and sinné for n = 3,4,5,if 6 is as define above). He 
stated what the pattern was and realized that it could be continued “in infinit 
progression” [p. 74]. 

Viéte also made this discovery in another paper, Ad angvlarivm sectionvm 
analyticen. Theoremata. xa0odixctepa, possibly written about 1590 but also 
published posthumously, in 1615, by Alexander Anderson.’* Since Viéte was 
more explicit in this second paper, we shall present his general result in this 
context. This work contains ten theorems, and we are interested in the firs 
three. The firs states the following (an explanation in today’s terminology 
follows) [p. 418]: 


THEOREM I. 


If there are three right triangles the acute angle of the first of which differs 
from the acute angle of the second by the acute angle of the third, the first 
being the largest of these, the sides of the third will have these likenesses: 

The hypotenuse will be analogous to the product of the hypotenuses of the 
first and second. 

The perpendicular will be analogous to the product of the perpendicular 
of the first and the base of the second minus the product of the perpendicular 
of the second and the base of the first. 

The base [will be analogous] to the product of the bases of the first and 
second plus the product of their perpendiculars. 


To interpret this theorem refer to the next figure of a half-circle resting 
on a diameter AB, which was used for the demonstration (actually, we have 
omitted two segments and three letters that are not useful for our purposes). 
The first second, and third triangles are ABE, ABD, and ABC, respectively, 
and the corresponding acute angles mentioned in the hypothesis above are 
LEAB, ZDAB, and CAB. The hypothesis itself—in spite of the possible 
confusion created by the word “differs” (differat)—is that 


LEAB = LDAB + LCAB. 


72 English translation of Ad angvlarium by Witmer as “Universal theorems on the analysis 
of angular sections with demonstrations by Alexander Anderson,” in The analytic art by 
Francois Viéte, pp. 418—450. Page references are to and quotations from this translation. 
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It is clear from the firs conclusion that the word “analogous” (simile) cannot 
be taken to mean “equal” for these triangles except in the particular case in 
which AB = 1. If we do this, the equations that were actually proved in the 
demonstration—in narrative form, of course—become 


CB=EBxAD—DBxAE 
and 


CA=AEx AD+ EB x DB 


(otherwise, each of these lengths must be divided by AB). Then, if we interpret 
that the sides AE, AD, and AC are the “bases” of the stated triangles and that 
the sides EB, DB, and CB are the “perpendiculars,” these equations represent 
the last two conclusions of the theorem, and the word “analogous” (simile) 
can be taken to mean “equal.” If we now defin a = ZEAB and @ = DAB 
and if we note that CAB = ZEAB — DAB =a — 0, we see that the firs 
equation is just Abu’l-Wafa’s formula for the sine of w — @ and the second is 
the companion formula for the cosine of a — 6. So this is not new (although it 
probably was to Viéte), and neither is the second theorem, containing equations 
that can be turned into similar formulas for a + 6. 

To present Viéte’s third theorem, which is our goal, we shall consider firs 
a different particular case, that in which D = C, so that ZEAB = 2/CAB. 
But at this point in the original paper there is a change in notation and the 
letters D and B are used to represent the lengths AC and CB, respectively. 
Any subsequent explanations using the letters D and B with two different 
meanings would be confusing. Some sort of compromise is in order, and in 
this we shall follow the lead of a master. When Newton was studying Viéte’s 
paper as a young man of 21 and writing his own understanding of it,’> he used 
lowercase letters for all items previously written in uppercase. Thus, we now 
write D = ad = ac and B = db = cb. Then the two equations of the firs 
theorem’s demonstration can be rewritten as 


B=ebx D—Bxae and D=aex D+ebxB 


73 Whiteside, ed., The mathematical papers of Isaac Newton, I, 1967, p. 78. 
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(for convenience, we continue taking ab = 1, which Newton did not). Then 
solving them for ae and eb and using the fact that D* + B? = 1 gives 


ae = D* — B? and eb=2DB. 


These are then the base and the perpendicular of the triangle abe, whose acute 
angle /eab is double the acute angle /cab. If this angle is denoted by 6, then, 
writing things in present-day notation, D = cos@, B = sin@, ae = cos26, 
and eb = sin 20, so that the preceding equations can be rewritten as 


cos 20 = cos* @ — sin? @ and sin 26 = 2sin@ cos@. 


Next we start a similar procedure, but this time with Zcab = 0, dab = 20, 
and Zeab = 36. Replacing e with d in the equations ae = D* — B? and 
eb = 2DB, we obtain 


ad = D* — B? and db = 2DB, 


in which we still have D = ac and B = cb. Taking these values to the 
equations of Theorem I, they become 


B =eb x (D* — B*) —2DB x ae 
and 
D =e x (D* — B*) + eb x 2DB. 


Solving these equations for ae and eb gives 
ae=D>—3DB* and = eb = 3D°B-— B?, 


which are the base and the perpendicular of the triangle abe, whose acute 
angle Zeab is triple the acute angle ‘cab = @. In present-day trigonometric 
notation, these equations are equivalent to 


cos 30 = cos’ 6 — 3cos@ sin” 6 
and 
sin 30 = 3cos*@sin@ — sin’ 0. 


Viéte did not include the preceding calculations in his paper, just the results, 
including also the cases of the quadruple and quintuple angles. That is, he gave 
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the base and the perpendicular of the triangle abe in the cases /eab = nZcab 
for n = 2,3, 4, and 5, as summed up in the table below. 


Angle Base Perpendicular 
Simple D B 
Double p= zB 2DB 
Triple D> — 3DB? 3D? B — B? 
Quadruple D? =—6D°R? +B" 4D? B —4DB> 
Quintuple Dp? = 10D? B* + 5DR* 5D*B—10D7B? + B° 


It is reproduced in the next page from Viéte’s original publication, on the right 
of the central vertical double line. In this table, g. represents a square, cub. or 
c. acube, gq. a fourth power, gc. afift power, and in indicates multiplication. 

The fact is that the computations involved in solving the equations of 
Theorem I for ae and eb are sufficient! tedious even for these few values 
of n (they were omitted above for n = 4,5), so this procedure cannot be 
comfortably continued indefinitel . But Viéte noticed a pattern in what he 
already had. The terms in each line of the table are those obtained on raising 
the binomial D + B to the nth power: 


(D+ BY = D? +2DB + B’, 

(D+ By =D? 43D BR +3DB" 4B, 

(D+B)* = D*+4D° B+ 6D" B? 4 4DR* + B", 

(+87 =D? +5D°RB +100 B* + 10D*B? +5 DR + Be, 


but different terms are allotted to the base and the perpendicular in a certain 
pattern, with alternating plus and minus signs. 

Generalizing from these fi e cases, Viéte proposed the following conjec- 
ture: if the resulting terms are separated into two groups (the odd terms and 
the even terms), and if the terms in each group are alternately positive and 
negative, the base ae will be the firs group of terms and the perpendicular eb 
the second. For instance, for n = 4, taking firs the odd and then the even 
terms of the expansion of (D + B)* and making them alternately positive and 
negative, we have 


ae = D* — 6D’ B? + B* and eb = 4D°B —4DB°, 
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Reproduced from Viéte, Ad angvlarivm sectionvm analyticen, p. 14 


There are some errors in this table, which were corrected in his Opera mathematica. 


as given in the table. Since the expansion of (D + B)" was well known in 
Viéte’s time for any positive integer n, it provided a general way to evaluate 
ae = cosné@ and eb = sinné from B = cos@ and D = sin@. Here is an 


extract of his very long statement [pp. 422—423]: 


THEOREM III. 


If there are two right triangles the acute angle of the first of which is a fraction 
of the acute angle of the second, the sides of the second are like this: 
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[To find] the analogues of the sides around the right angle correspond- 
ing to the hypotenuse, construct an equally higher power of a binomial root 
composed of the base and perpendicular of the first [triangle] and distribute 
the individual homogeneous products successively into two parts, the first of 
each being positive, the next negative, [and so on]. The base of the second 
triangle becomes the analogue of the first of these parts, the perpendicular 
the analogue of the second. 


There is more of Viéte’s trigonometry starting in Chapter XIX of his Va- 
riorvm de rebvs mathematicis responsorvm, liber VIII (Answers to various 
mathematical things, book VIII).”4 There are quite a few plane trigonometric 
identities in this treatise [pp. 401—403], including of course the law of sines in 
the form Latera sunt similia sinibus, quibus ea subtendentur (The sides are like 
the sines that they subtend) [p. 402]. By this time Viéte had changed some of 
his terminology, referring to the tangent as prosinus rather than fecundus and 
to the cotangent as prosinus complementi (while the hypotenusa had become 
transsinuosa). For example, imagine a triangle with a horizontal side, called 
the base, and refer to the other two sides as the legs. Using his new terminology, 
Viéte made a statement that can be translated as follows [p. 402]:”° 


As the sum of the legs is to their difference, so is the tangent of half the sum 
of the base angles to the tangent of half their difference. 


If the legs are denoted by a and b, if A and B are the angles opposite a and b 
(thus, they are the base angles), and if A is assumed to be larger than B, this 
statement can be rendered in current terminology as 


a+b tan3(A+B) 
a—b  tan}(A—B) 


and is known as the law of tangents. 

Viéte’s success in trigonometry was based on his superior command of 
algebra, the crucial step being his consistent use of consonants for known 
quantities and vowels for the unknowns. His literal expressions, which he 
introduced in the Canon mathematicvs of 1579, were instrumental in freeing 


74 Page references are to Schooten, Francisci Viet opera mathematica, 1646. 
75 The original reads: Vt adgregatum crurum ad differentiam eorundem, ita prosinus 
dimidie summe angulorum ad basin ad prosinum dimidie differentie. 
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algebra from the necessity of considering just specifi examples, of which we 
shall see plenty from earlier times in Chapter 3. 

But the recently mentioned law of tangents was not discovered firs by 
Viéte. It appeared ten years earlier in a book intended as a text: Thome Finkii 
flenspurgensis geometric rotundi [circular geometry] libri XIIII by Thomas 
Fincke (1561-1656), a physician and mathematician from the town of Flens- 
burg, Denmark (now Germany). He stated it in a rather complicated verbal 
form that with the help of a figure ”° if the notation is the same as for Viéte’s 
statement, and if A is assumed to be larger than B, can be rendered into modern 
terminology as follows: 


(a+b) tan +(A + B) 


Va+b)—b  tan[4(A+B)—B] 


Of course, this simplifie to Viéte’s equation. 

But Fincke’s fame does not rest on the law of tangents, which is not even 
included in today’s trigonometry books, since it is easily derived from the law 
of sines, but rather on his having coined the present names for the tangent and 
the secant. He did use the tangent to describe the right-hand side above as sic 
tangens semissis anguli crurum exteriores, ad tangemtem ... (as the tangent 
of the semi[sum] of the angles opposite the legs is to the tangent of ...), but 
he had already introduced them by the statement: Recta sinibus connexa est 
tangens peripheric, aut eam secans (A straight line associated with the sine is 
the tangent of the arc, or also the secant), on page 73 of his book, expressing 
also the hope that these terms be used in this connection. Viéte was opposed to 
them because they could be confused with the same terms used in geometry, 
but history made Fincke the winner. 

The major contributions to trigonometry up to the end of the sixteenth 
century have already been presented, but there is the embarrassing matter of 
terminology. It is clear that, with the exception of the word sinus used by 
Gherardo de Cremona and Regiomontanus, neither the name for the subject 
nor the familiar notation and terminology existed for most of this time. Viéte 
himself did not adopt the use of sinus until 1593, and it was not until the 1590s 
that the words tangens and secans started gaining adherents. 

The reader has had no choice but to have the subject described in this 
chapter referred to as trigonometry, its modern name. This is because this 


76 Both of them can be seen in Zeller, The development of trigonometry from Regiomon- 
tanus to Pitiscus, p. 89. 
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subject remained unnamed almost to the end of the sixteenth century. Finally, 
Bartholomaus Pitiscus (1561-1613) provided a name for it with the publication 
of his book Trigonometria: sive de solvtione triangvlorvm tractatus breuis et 
perspicuus (Trigonometry: or a brief and clear treatise on the solution of 
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Title page and firs page of Pitiscus, Trigonometria, 1595. 
From Eidgenéssische Technische Hochschule Bibliothek, Ziirich. 


triangles) in 1595. It was that brevity and clarity that made the Trigonometria 
a very popular book, to the extent that it was reissued three times in Latin and 
was soon translated into English.’” 

As for the usual abbreviations that are so familiar today, it was Fincke who 
introduced “‘sin.”, “tan.”, and “sec.” in Book 14 of his Geometric rotundi, in 
connection with spherical trigonometry. While these were adopted by Tycho 
Brahe in a manuscript of 1591 and by Pitiscus in the second edition of his 
Trigonometrie of 1600, they were not universally accepted at that time. It 


77 The word “trigonometry” firs appeared in English in 1614 in the translation of the same 
work as Trigonometry, or, the doctrine of triangles by Raph Handson. 
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was not until 1633 that William Oughtred (1574-1660) used “sin” (without 
the period) for sine, in An addition vnto the vse of the instrument called the 
circles of proportion for the working of nauticall questions (an addition to The 
circles of proportion). 

The word cosine was a late arrival in comparison with tangent and secant. 
After all, it was not needed, since it was very easy to avoid inventing cos A by 
referring to sin(90° — A). In this manner, it was called the complement of the 
sine or, as Regiomontanus and Viéte later wrote, sinus complementi. But then, 
from this description, the Englishman Edmund Gunter (1581-1626) coined 
the term co.sinus in his Canon triangulorum of 1620, as well as the term 
co.tangens. They did not gain general acceptance, but John Newton (1622— 
1678), a teacher of mathematics and textbook author, did adopt them, while 
changing co.sinus into cosinus, in his Trigonometria Britanica of 1658. This 
term fared better, and Sir Jonas Moore (1627-1679) in his A mathematical 
compendium of 1674, abbreviated it to “Cos.” and co.tangens to “Cot.”. 

However, there was a reversal of usage during the eighteenth century, 
when three-letter abbreviations ending with a dot became commonplace on 
the Continent. In the main, the English reverted to two-letter abbreviations 
without a dot. It was probably Euler’s use of “sin.”, “‘tan.”, “sec.”, and “cot.” 
in Articles 242 to 249 of his Introductio in analysin infinitorum of 1748 that 
sealed the use of this notation. But Euler used “tang.” and “cosec.”, and on 
the use of abbreviations for these functions there are variations to this day. 


LOGARITHMS 


2.1 NAPIER’S FIRST THREE TABLES 


The mists of sixteenth-century Scotland frequently engulfed Merchiston Tower, 
near Edinburgh. But today Merchiston Tower is in downtown Edinburgh, right 
at the heart of the Merchiston Campus of Napier University. In the sixteenth 


MERCHISTON TOWER IN THE EIGHTEENTH CENTURY 


The lower building with slanted roof and the chimney are eighteenth-century additions. 
From Knott, Napier tercentenary memorial volume, 1915, p. 54. 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 78 
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century its seventh laird was Archibald Napier (Neper, Nepier, Nepair, Napeir, 
Nappier, Napper?). John Napier (Nepero in Latin, as we shall see later) was 
born in 1550 to Sir Archibald and his firs wife, Janet Bothwell. At that time, 
Mary Tudor (later Stuart) was queen of the Scots and Edward VI king of the 
English. Three years earlier—to put things in perspective—Henry VIII had 
died, Cervantes was born, and Ivan IV (the Terrible) crowned himself tsar of 
Russia, the firs with that title. 

John Napier, an amateur Calvinist theologian who had studied religion 
at Saint Andrews, predicted that the end of the world would occur in the 
years between 1688 and 1700, in his book A plaine discouery of the whole 
Reuelation of Saint Iohn of 1593. Not a huge success at prediction, it must be 
pointed out. Napier’s reputation may not have reached the present time either 
based on his theological studies or on his activities as a designer of weapons 
of mass destruction, intended for use against the enemies of the true religion. 
His lasting reputation is due to his third hobby: mathematical computation. 
As a landowner, he never held a job, and his time and energy could be entirely 


PossIBLY JOHN NAPIER, COMPUTING 
Portrait by Francesco Delarame, engraving by Robert Cooper (c. 1810). 
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devoted to intellectual pursuits. 

He developed an interest in reducing the labor required by the many tedious 
computations that were necessary in astronomical work, involving operations 
with the very large values of the trigonometric lengths given in the tables of 
his times. His interest in these matters might have been rekindled in 1590 on 
the occasion of a visit by Dr. John Craig, the king’s physician. James VI of 
Scotland had set out to fulfil an important royal duty, to fin a wife, and had 
selected Princess Anne of Denmark. In 1590, on the consent of her brother 
Christian IV, James set sail to Copenhagen to pick her up. Bad weather, 
however, caused his ship to land firs on the island of Hveen near Copenhagen, 
which was the location of Uraniborg, the astronomical observatory of Tycho 
Brahe. 

On his return from Denmark, Dr. John Craig paid the aforementioned visit 
to Napier to inform him of his finding while visiting Tycho Brahe. There were 
new, ingenious ways to perform some of those tedious computations required 
by astronomical calculations, and foremost was the use of prosthapheresis. 
This imposing name, coined from the Greek words for addition (xpoa6éaic) 
and subtraction (dgaipeaic), refers to the use of the trigonometric identity 


sinA-sinB = S[cos(A — B)—cos(A+ B)], 


brought to Hveen by Paul Wittich (c. 1546-1586), an itinerant mathematician 
who visited Uraniborg for four months in 1580.1 So, why should this be all 
the rage in computation? Let us say that we want to evaluate the product of 
two 10-digit numbers a and b. We know that tables of sine and cosine values 
were available at that time, and it was not a difficul matter to fin numbers A 
and B, at least approximately, such that a = sin A and b = sin B (it may be 
necessary to divide or multiply a and b by a power of 10, but this can easily 
be managed). Then perform the simple operations A — B and A + B, fin 
their cosines in the table, subtract them as indicated in the previous formula, 
and divide by 2, and that is the product ab (except for an easy adjustment by a 
power of 10). The big problem of multiplying many-digit numbers was thus 
reduced to the simpler one of addition and subtraction. 

But what about quotients, exponentiations, and roots? Napier sought a 
general method to deal with these computations, eventually found it, and 


1 Its European origin goes back to Johannes Werner, who probably discovered it about 
1510, but a similar formula for the product of cosines is usually credited to the Egyptian 
Abu’ 1-Hasan Ali ibn Abd al-Rahman ibn Ahmad ibn Yunus al-Sadaf al-Misri (c. 950-1009). 
Wittich’s contribution was his realization that it can be used for multiplication of any two 
numbers. See Thoren, The Lord of Uraniborg, 1990, p. 237. 
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gave it to the world in his book Mirifici logarithmorum canonis descriptio 
(Description of the admirable table of logarithms) of 1614 (he had become 
eighth laird of Merchiston in 1608). The English translation’s author’s preface 
starts with a statement of his purpose [p. A2]: 


Seeing there is nothing (right well beloued Students in the Mathematickes) 
that is so troublesome to Mathematicall practise, nor that doth more molest 
and hinder Calculators, then the Multiplications, Diuisions, square and cubi- 
cal Extractions of great numbers, which besides the tedious expence of time, 
are for the most part subject to many slippery errors. I began therefore to 
consider in my minde, by what certaine and ready Art I might remoue those 
hindrances.* 


This is a small volume of 147 pages, 90 of which are devoted to mathe- 
matical tables containing a list of numbers, mysteriously called logarithms, 
whose use would facilitate all kinds of computations. Before the tables, some 
geometrical theorems are given in the Descriptio about their properties, and 
examples are provided of their usefulness. About the use of logarithms we 
shall talk presently, but firs we should say that no explanation was given 
in this book about how they were computed. Instead, there is the following 
disclaimer in an Admonition in Chapter 2 of The first Booke [pp. 9-10]: 


Now by what kinde of account or method of calculating they may be had, it 
should here bee shewed. But because we do here set down the whole Tables, 
..., we make haste to the vse of them: that the vse and profi of the thing 
being firs conceiued, the rest may please the more, being set forth hereafter, 
or else displease the lesse, being buried in silence. 


In short, if his Tables were well received Napier would be happy to explain 
how the logarithms were constructed; otherwise, let them go into oblivion. It 
happened that the Descriptio was a huge editorial success; a book well received 
and frequently used by scientists all over the world. Then an explanatory book 
became necessary but, although it was probably written before the Descriptio, 
it was published only posthumously (Napier died in 1617, one year after 
Shakespeare and Cervantes) under the title Mirifici logarithmorvm canonis 
constrvctio, when his son Robert included it with the 1619 edition of the 
Descriptio? 


? From Wright’s translation as A description of the admirable table of logarithmes (“all 
perused and approued by the Author”), 1616. Page references are to this edition. 

3 Page references are given by article number so that any of the available sources (see 
the bibliography) can be used to locate them. Quotations are from Macdonald’s translation. 
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Canonis defcriptio, 
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From Knott, Napier tercentenary memorial volume, Plate IX facing page 181. 


The Constrvctio gives a glimpse into Napier’s mind and his possible 
sources of inspiration. It starts as follows: 


1. A logarithmic table [tabula artificialis] is a small table by the use of 
which we can obtain a knowledge of all geometrical dimensions and motions 
in space, by a very easy calculation. 


. very easy, because by it all multiplications, divisions and the 
more difficul extractions of roots are avoided ... 


It is picked out from numbers progressing in continuous proportion. 
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2. Of continuous progressions, an arithmetical is one which proceeds by 
equal intervals; a geometrical, one which advances by unequal and propor- 
tionately increasing or decreasing intervals. 


It will be easier to interpret what Napier may have had in mind using modern 
notation. To us, a geometric progression is one of the form a, ar, ar’,ar?,..., 
where a and r are numbers, and it is clear that the exponents of r form an 
arithmetic progression with step 1. It was common knowledge in Napier’s 
time that the product or quotient of two terms of a geometric progression 
contains a power of r whose exponent is the sum or difference of the original 
exponents.* 

To take a specifi instance, Michael Stifel (1487-1567) had published a 
list of powers of 2 and their exponents in his Arithmetica integra of 1544, 
with the exponents in the top row (in superiore ordine) and the powers in the 


oO» fe 2e Ze 4 5° 6. 7 Se 
te 2 fe Se G4 320 64, 128. 256, 


bottom row (in inferiore ordine), and right below he taught his readers how to 
use the table to multiply and divide (translated from Liber III, p. 237): 


As the addition (in the top row) of 3 and 5 makes 8, thus (in the bottom 
row) the multiplication of 8 and 32 makes 256. Also 3 is the exponent of 
eight itself, & 5 is the exponent of the number 32. & 8 is the exponent of 
the number 256. Similarly just in the top row, from of the subtraction of 3 
from 7, remains 4, thus in the bottom row the division of 128 by 8, makes 16. 


That is, to multiply 8 = 23 and 32 = 2° just add the exponents to obtain 
28 = 256, and to divide 128 = 2’ by 8 = 2? just subtract the exponents 
to obtain 24 = 16. Thus multiplication and division are easily reduced to 
additions and subtractions of exponents. 


4 When writing r” I am using current notation, which did not exist in Napier’s time, for 
it was introduced by Descartes in La Géométrie, 1637. On page 299 of the original edition 
(as an appendix to the Discours de la méthode, La Géométrie starts on page 297) Descartes 
stated: Et aa, ou a?, pour multiplier a par soy mesme; Et a>, pour le multiplier encore vne 
fois par a, & ainsi a V’infini (And aa, or a*, to multiply a by itself, And a+, to multiply 
it once more by a, and so on to infinity) Quoted from Smith and Latham, The geometry 
of René Descartes, 1954, p. 7. This statement can also be seen, in the English translation 
by Smith and Latham only, in Hawking, ed., God created the integers, The mathematical 
breakthroughs that changed history, 2005, p. 293. 
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But what about 183/11? There are no such table entries, not even approx- 
imately. The trouble is that the values of 2” are very far apart. For values 
of r” to be close to each other, r must be close to 1. Napier realized at this 
point that the computations of powers of such an r may not be easy (think of, 
say, r = 1.00193786), expressing himself as follows in the Constrvctio: 


13. The construction of every arithmetical progression is easy; not so, 
however, of every geometrical progression. ... Those geometrical progres- 
sions alone are carried on easily which arise by subtraction of an easy part 
of the number from the whole number. 


What Napier meant is that if r is cleverly chosen, multiplication by r can be 
reduced to subtraction. In fact, he proposed—in his own words in Article 14— 
to choose r of the form r = 1 — 10~*. Then multiplication of a term of the 
progression by r to obtain the next one is equivalent to a simple subtraction: 


art! = ar"(1— 10-*) =ar" — aro, 


and the last term on the right is easily obtained by a shifting of the decimal 
point. With such an r, it is not necessary to evaluate its successive powers by 
multiplication to obtain the terms of the progression. 

On this basis Napier made his choice of a and r. First he chose a = 
10,000,000 because at this point in his life he was mainly interested in com- 
puting with sines, and the best tables at that time (c. 1590), those of Regiomon- 
tanus and Rhaeticus, took the whole sine (sinus totus) to be 10,000,000. Then, 
in order for the terms of his geometric progression to be sines (that is, parts 
of the whole sine), he had to choose r < 1; and for ease of computation he 
selected 

r = 1 — 0.0000001. 


We have talked about shifting the decimal point as a simple operation. 
Was such a procedure, or even the decimal point, available in Napier’s time? 
The use of decimal fractions can be found in China, the Islamic world, and 
Renaissance Europe. It eventually replaced the use of sexagesimal fractions, 
then in vogue, after Simon Stevin (1548-1620), of Bruges, explained the 
system clearly in his book in Flemish De thiende (“The tenth”) of 1585. Their 
use had already been urged in the strongest terms by Francois Viéte in his 
Vniversalivm inspectionvm of 1579 [p. 17]: 


Finally, sexagesimals & sixties are to be used sparingly or not at all in Mathe- 
matics, but thousandths & thousands, hundredths & hundreds, tenths & tens, 
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and their true families in Arithmetic, increasing & decreasing, are to be used 
frequently or always.> 


Not only was Napier aware of decimal fractions and points, but, through 
his continued use of decimal notation in the Constrvctio, he may have been 
instrumental in making this notation popular in the world of mathematics. He 
explained it clearly at the beginning of the Constrvctio: 


5. In numbers distinguished thus by a period in their midst, whatever is 
written after the period is a fraction, the denominator of which is unity with 
as many ciphers [zeros] after it as there are figure after the period. 


Thus 10000000-04 is the same as 10000000 ; also 25-803 is the 


same as 25 5°; also 9999998-0005021 is the same as 9999998 071 


and so of others. 


With his use of decimal notation thus clearly settled and his choice of a 
and r, Napier then continued his construction in Article 16 as follows: 


Thus from the radius, with seven ciphers added for greater accuracy [that 
is, with seven zeros after the decimal point], namely, 10000000.0000000, 
subtract 1.0000000, you get 9999999.0000000; from this subtract .9999999, 
you get 9999998.0000001; and proceed in this way until you create a hun- 
dred proportionals, the last of which, if you have computed rightly, will be 
9999900.0004950. 


To sum up in present-day notation, Napier had applied the formula 


107r"*! = 107r"(1 — 1077) = 107r” —r” 


5 But Viete did not use the decimal point. Instead, he wrote the decimal part of a number 
as a fraction with an empty denominator, giving the semicircumference of a circle whose 
sinus totus is 100,000 as 314,159, 26 [p. 15]. Viéte used this notation through most of 
the book, but later he modifie it, using firs a vertical bar to separate the decimal part, 
which was also written in smaller or lighter type. For instance, he gave the sinus of 60° as 
86,602|540,37 [p. 64]. Later, he replaced the vertical bar with another comma, writing now 
the semicircumference of the given circle as 314,159,265,36 [p. 69]. 
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with n = 0,..., 100 to perform the following computations:® 


10000000 x 0.9999999° = 10000000 
10000000 x 0.9999999! = 9999999 
10000000 x 0.9999999? = 9999998 .0000001 
10000000 x 0.99999993 = 9999997.0000003 


10000000 x 0.9999999!"° — 9999900.0004950. 


This completes his First table, which, writing in Latin, he called “canon.” Of 
course, the entries on the right-hand sides of the fourth line and those below 
are only approximations, as shown by the fact that we have only subtracted 
0.9999998 from the third entry to obtain the fourth, but Napier had already 
remarked in Article 6: When the tables are computed, the fractions following 
the period may then be rejected without any sensible error. 

This firs table is just the start of Napier’s work, and these entries do not 
even appear in the Descriptio. The use of this table in computation is limited in 
at least two ways (we will mention a third later on). The firs is that it contains 
very few entries, and the second is that all the numbers on the right-hand sides 
are very close to 10,000,000 due to the choice of r. But these problems are 
easy to deal with by computing more entries and choosing a different value 
of r. 

Thus Napier started another table, but this time, to move a little further 
from 10,000,000, he chose r to be the last number in the firs canon (omitting 
the decimals, for dealing with such large numbers one need not trouble with 
them) over the firs one, 


9999900 1 
=> —$— > (e = 1 — ’ 
” * 10000000 aa 100000 


8 At the risk of delving into tedium by harping on mathematical notation, the x sign 
for multiplication did not exist in Napier’s time. It was introduced for the firs time in an 
anonymous appendix to the 1618 edition of Wright’s translation of the Descriptio entitled 
“An appendix to the logarithmes,” pp. 1-16, probably written by William Oughtred. It is 
reprinted in Glaisher, “The earliest use of the radix method for calculating logarithms, with 
historical notices relating to the contributions of Oughtred and others to mathematical no- 
tation,’ 1915. It also appears in Oughtred, Clavis mathematic (The key to mathematics), 
1631. However, the = sign that we use below did exist; it had been introduced by Robert 
Recorde (1510-1558) in his algebra book The whetstone of witte of 1557. The original sym- 
bol was longer than the present one, representing two parallel lines “bicause noe. 2. thynges, 
can be moare equalle.” This book has unnumbered pages, but is divided into named parts. 
This quotation is from the third page of “The rule of equation, commonly called Algebers 
Rule.” This is page 222 of the text proper, not counting the front matter. 
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and computed 51 entries by the same method as before, but now using the 
equation 107r”+! = 107r” — 100r” to obtain 


10000000 x 0.99999° = 10000000 
10000000 x 0.99999! = 9999900 


10000000 x 0.99999°° = 9995001.222927 


(he erred: the last entry should be 9995001.224804). This is his second table. 
Then he started again with the ratio 


9995000 1 
"= Too00000 ~ 2??? 2000 


and computed 21 entries: 


10000000 x 0.9995° = 10000000 


10000000 x 0.999579 = 9900473.57808. 


He placed the numbers so obtained in the firs column of his third canon. 
Finally, from each of these values he computed 68 additional entries using the 
ratio 
9900473 
0.99 = 


~ 10000000. 


and these he located horizontally from each firs value. Thus, his third table had 
69 columns, and the entries in each column are 0.99 of those in the preceding 
column. The last entry of the last column is 


9900473.57808 x 0.99% ~ 4998609.4034, 


“roughly half the original number” (again, the last two decimal digits are in 
error). 

From Napier’s point of view, interested as he was at the time in computa- 
tions with sines and applications to astronomy, this was enough, since—other 
than the magnificatio by 10,000,000—the right-hand sides above are the 
sines of angles from 90° to 30°. He would easily deal with smaller angles in 
Articles 51 and 52 of the Constrvctio. 
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First Column. 


Second Column. 


69th Column. 


10000000.0000 9900000.0000 5048858.8900 
9995000.0000 9895050.0000 5046334.4605 
9990002.5000 9890102.4750 5043811.2932 
9985007.4987 9885157.4237 5041289.3879 
9980014.9950 9880214.8451 5038768.7435 


9900473.5780 


9801468.8423 


4998609.4034 


Now let us take stock of what Napier had and had not achieved to this 
point. He had elaborated several tables and subtables based on the idea of 
using the arithmetic progressions of what we now call exponents to perform 
computations with the table numbers. These contain about 1600 entries, not 
quite a sufficien number; and, what is more, the various tables are based on 
different choices ofr, so that the product or quotient of numbers from different 
tables is impossible. Furthermore, there is a third problem in that his table 
entries are laden with decimals, while the sines in the tables of his time were 
all whole numbers (for simplicity, we shall refer to a sinus as a sine from now 
on). Something else had to be done. 
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What Napier needed is the reverse of what he had. In his firs three tables the 
exponents are integers but the sines are laden with decimals. What he needed 
was to start with sines that are whole numbers, available in published tables, 
and then compute the corresponding exponents whatever they may be. 

His next idea was based on a graphical representation, and once polished 
he gave it to us as follows in the Constrvctio. First he chose a segment ST, of 
length 10000000, to represent the Sinus Totus [Art. 24], and let 1, 2, 3, and 4 
be points located in the segment TS as shown. Assume that TS, 1S, 25, 3S, 
4S,... are sines in continued proportion 


1S 2S 3S 48 
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such as those of the First table, in which case the stated ratio is 0.9999999, 
15 = 9999999, and so on. Then he envisioned a point g that, starting at T 
with a given initial velocity, proceeds toward S with decreasing velocity, in 
such a manner that the moving point [Descriptio, p. 2] 


in equall times, cutteth off parts continually of the same proportion to the 
lines [segments] from which they are cut off. 


This means, referring to the figur above and noting that subtracting each of 
the preceding fractions from | and simplifying gives 


Tl 12 23 34 


TS 1S 2S 3S 


that g traverses each of the segments T1, 12, 23, 34, ... in equal times. 
Napier referred to the described motion of g as geometrical (hence his use of 
the letter g). Differential equations were not known in Napier’s time, and he 
could not explain this motion as clearly as we might wish. 

Assume next that 15, 2S, 35,45, ... are the sines in the First table, and 
imagine an infinit half-line bi (drawn in the next figure) on which we choose 
coordinates 0, 1, 2, 3, 4, ... as follows. First, the origin is at b, and then 
let a new point a move on this line from d to the right with constant velocity, 
that of g when at T. We choose the coordinates 1, 2,3, 4,... (not shown in 
Napier’s figure to be the points on bi at which a arrives when g arrives at the 
points with labels 1, 2, 3,4, .... Thus, as g passes through the points marking 
the sines of the First table in equal amounts of time, a reaches the exponents 
1,2,3,4,... that generate these sines. 

Napier chose to introduce this second line in a general situation, consid- 
ering the case of an arbitrary sine as follows [Art. 26]: 


Let the line TS be radius, and dS a given sine in the same line: let g move 
geometrically from T to d in certain determinate moments of time. Again, 
let bi be another line, infinit towards 7, along which, from b, let a move 
arithmetically [constant velocity] with the same velocity as g had at firs 
when at T: and from the fi ed point b in the direction of 7 let a advance in 
just the same moments of time [as it took g to advance to d] to the point c. 
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Then, in general, if the coordinates on the line bi are as we have just 
described, if dS is an arbitrary sine, if g and a start with the same initial 
velocity, and if a arrives at a point c in the same amount of time as g arrives 
at d, the distance bc represents the exponent—not necessarily an integer— 
corresponding to the sine dS. 

In the Constrvctio [Article 26], Napier called the distance bc the “artificia 
number” of the “natural number” dS, but he later changed his terminology. 
Since the sines in Napier’s basic tables are in continued proportion or ratio, he 
later coined the word logarithm, from the Greek words A6yav (logon) = ratio 
and dpi0udc (arithmos) = number, to refer to each of these exponents. In the 
Descriptio, written a few years after the Constrvctio, he gave the definitio as 
follows [pp. 4—5]: 


The Logarithme therefore of any sine is anumber very neerely expressing the 
line, which increased equally in the meane time, while the line of the whole 
sine decreased proportionally into that sine, both motions being equal-timed, 
and the beginning equally swift. 


By the time the translation of the Constrvctio was prepared, the word logarithm 
was already in common use, and that motivated the translator to use it there. 
We shall do so from now on. 

At this point, Napier faced the task of computing the logarithms of sines 
given by whole numbers. The start is easy: 


27. Whence nothing [zero] is the logarithm of radius. 


Quite clearly: if g and a do not move they remain at T and b, respectively. 
But the rest, the exact computation of logarithms of whole numbers, was far 
from trivial with the tools available to Napier (the already mentioned lack 
of differential equations was responsible). On the face of that impossibility, 
he made a giant leap by realizing that it may be good enough to fin lower 
and upper bounds for such logarithms. With a certain amount of ingenuity he 
found such bounds. 
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28. Whence also it follows that the logarithm of any given sine is greater 
than the difference between radius and the given sine, and less than the 
difference between radius and the quantity which exceeds it in the ratio of 
radius to the given sine. And these differences are therefore called the limits 
of the logarithm. 


This can be explained, and then his proof abbreviated, as follows. Let dS be 
the given sine and let oS be the quantity that exceeds the radius in the ratio 
os TS 
TS dS 
According to the geometric motion of g and the arithmetical motion ofa, “oT, 
Td, and bc are distances traversed in equal times.” Then, since the velocities 


oO T d S 


of g and a are equal when g is at T and a at b, and since the velocity of g is 
decreasing but that of a is constant, 


oS — TS =oT > bc (= logdS) > Td = TS —ds5, 


which was to be proved. 

It goes without saying that a present-day reader would like a restatement 
of Napier’s result in current notation.’ That is not difficul if we denote the 
given sine dS by x. Then oS = TS?/x, and we can write the inequalities of 
Article 28 in the form 


TS? _—_ TS(TS — x) 


Xx Xx 


TS — x < logx < 


to which we have appended an equality from Article 29. It was this result that 
enabled Napier to fin the logarithms of many sines, starting with those in the 


7 Napier stated all of his results in narrative form. The Constrvctio does not contain a 
single modern-looking formula, nor is any notation used for “artificia number” or logarithm. 
I have, however, adopted log as a natural choice, since there is no possibility of confusing 
this with any current mathematical function at this point. 
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First table (they are not all whole numbers, but their logarithms will be useful 
nevertheless). Starting with x = 9999999 we have TS — x = 1.0000000 and 


TS(TS—x) _ 10000000 
x ~ 9999999 


= 1.00000010000001.8 


Then, since these limits differ insensibly, Napier took [Art. 31] 
log 9999999 = 1.0000000S. 


Now, if we denote once more the sines in the First table—that is, the 
numbers on the right of the equal signs—by TS, 1S, 2S, 3S, ..., we have 
already seen that as g goes through them in equal times, their logarithms 
follow the motion of a and we have [Art. 33] 


log2S =2log1S, log3S =3log1S, 
and so on. In particular, for the last sine in the First table we have 
log 9999900.0004950 = 100 log 15 = 100 log 9999999, 
and then, from the bounds on log 9999999, 
100 < log 9999900.0004950 < 100.00001000. 


We can pick log 9999900.0004950 = 100.000005. 

This method works well because the sines in the First table are very close 
to 10000000. But the theorem in Article 28 is not sufficient! accurate further 
down the line. For instance, if x = 9000000, which is still a lot closer to 
10000000 than to zero, it would give 


10000000(10000000 — 9000000) 
9000000 


That is, 1000000 < log 9000000 < 1111111, which is too wide a range to 
select log 9000000 with any accuracy. 
Thus Napier needed a better theorem, and actually found two. The firs is: 


10000000 — 9000000 < log 9000000 < 


36. The logarithms of similarly proportioned sines are equidifferent. 


8 A reader who is concemed that the application of this formula requires long division is 
invited to count the number of such divisions that will be necessary to elaborate Napier’s 
complete table of logarithms. The total count will turn out to be extremely small. 
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This means that if v, x, y, and z are sines such that 


then 
log x — loguv = logz — logy. 


Napier stated [Art. 32]: “This necessarily follows from the definition of a 
logarithm and of the two motions.” This is because g traverses the distance 
x — v in the same amount of time as z — y, and then a moves from log x to 
log v in the same amount of time as from log z to log y. 

His second theorem is a little more involved: 


39. The difference of the logarithms of two sines lies between two limits; 
the greater limit being to radius as the difference of the sines to the less sine, 
and the less limit being to radius as the difference of the sines to the greater 
sine. 


After providing a geometric proof similar to that in Article 28 but a little longer 
and requiring Article 36, he rephrased this result in a verbal form [Art. 40] 
that can be rewritten in today’s terminology as follows. If x and y are the two 
sines and y is the larger, then 


TS(y — x) TS(y — x) 


< logx — logy < 


Then he immediately put this result to good use by showing how to compute 
the logarithm of any number that is not a sine in the First table, but near one 
of them [Art. 41]. 

For example, let x = 9999900. In the First table the nearest sine is y = 
9999900.0004950 = 10000000 x 0.9999999! and, as shown in Article 33 
(page 92), 

100 < log y < 100.0000100. 


Then, with x and y as above, we have TS(y — x) = 4950, and the inequali- 
ties in Article 40 (stated here after quoting Article 39) give the approximate 
inequalities 


0.00049500495002 < log x — logy < 0.00049500495005. 


Napier did not state these inequalities, but only concluded that we can take 
log x — log y = 0.0004950, and then 


log x = log y + 0.0004950. 
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Hence, from the bounds for log y, 
100.0004950 < logx < 100.0005050. 


But x = 9999900 appears in the Second table as 10000000 x 0.99999!, 
and the sines in this table are in continued proportion, just as TS, 1S, 2S, 
3S,... are in Napier’s general formulation. As g goes through them in equal 
times their logarithms are found from the motion of a in the same equal times. 
They are double, triple, etc., the logarithm of x. Thus, from the limits found 
above for log x, the limits for the logarithms of the sines of the Second table are 
200.0009900 and 200.0010100 for the second, 300.0014850 and 300.0015150 
for the third, and so on until the limits for the logarithm of the last sine (the 
50th after the first are found to be 


5000.0247500 < log 9995001 .222927 < 5000.0252500. 


The actual logarithms of these sines can be chosen between the limits stated 
here [Art. 42]. Thus, the logarithms of the sines of the Second table have been 
quickly obtained by a cascading procedure identical to that giving those of the 
sines in the First table. 

Next, in [Art. 43], “to fin the logarithms of... natural numbers ... not 
in the Second table, but near or between” the sines in this table, Napier used 
a new method. As above, start by naming a given sine and the one nearest to 
it in the Second table by x and y, with y > x. But now choose what Napier 
called a fourth proportional, that is, a number z such that (in our terminology) 

Zz X 
TS sy 
which is possible because the other three numbers are known. This places z 
within the bounds of the First table because x ~ y implies that z ~ TS, and 
then the logarithm of z can be computed by the previous method. Then, the 
theorem in Article 36 and the fact that log TS = 0 give 


logz = logx — logy. 


In this equation, two of the logarithms are known and the third can then be 
found. Or, more cautiously, the limits for two of these logarithms can be 
determined, whence the limits for the third can be ascertained. 

Napier applied this procedure to obtain the logarithm of x = 9995000, 
which is near the last entry, y = 9995001.222927, of the Second table. We 


SECTION 2.2 NAPIER’S LOGARITHMS 95 


have already found that 
5000.0247500 < log y < 5000.0252500, 


and the fourth proportional is 


9995000 
= TS ——-—_ = 9999998.7764614. 
2 = TS 9995001.222027 


Then, replacing x with z and y with 9999999 in the limits of Articles 39 and 


40, yields 
T S(9999999 — z) 


= 0,2235386 
9999999 
ae T $(9999999 — z) 
““ = 02235386. 
Zz 


It follows that 
log z — log 9999999 = 0.2235386. 


Since we know that 1 < log 9999999 < 1.0000001, we obtain 
1.2235386 < logz < 1.2235387. 


The equation log x = log y + log z and the limits found for the two logarithms 
on the right-hand side give (these are Napier’s figures 


5001.2482886 < logx < 5001.2487888. 
From these limits Napier concluded as follows: 


Whence the number 5001.2485387, midway between them, is (by 31) taken 
most suitably, and with no sensible error, for the actual logarithm of the given 
sine 9995000. 


Unfortunately, his result is a little off because the last entry of the Second 
table is in error, as mentioned above. On the good side of things, this got 
him into the Third table, whose second entry is precisely 9995000. Then the 
logarithms of the firs two entries in the firs column of the Third table are 
known, and the result in Article 36 allowed him to cascade down, computing 
the logarithms of all the entries in this column [Art. 44]. Then the logarithms 
of numbers that are near or between the entries of this column can be computed 
by the method of the fourth proportional illustrated above. Using this method, 
he computed the logarithm of 9900000, which is near the last entry of the firs 
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column of the Third table [Art. 45]. But this is the firs entry in the second 
column. Now denote the firs two sines in the firs column by x and y and the 
firs two sines in the second column by u and v. We know that 


u 


Xx 


and from this that ‘i 


u 


ael< <“[e 


By Article 36, log v — logu = log y — log x, and, since the last difference 
is known, log v can be computed. Cascading down, the logarithms of all the 
sines in this second column can be found. Finally, once the logarithms of 
the numbers in the firs two columns were obtained, the fact that the entries 
in each row of the Third table are in continued proportion means that their 
logarithms are equidifferent. This allowed Napier to compute the logarithms 
of all the entries in the Third table [Art. 46]. 

Having completed this second stage of his work, Napier now discarded 
the First and Second tables, keeping only the Third table with the logarithms 
of all its entries. This—the framework for the construction of the complete 
logarithmic table—he called The Radical Table [Art. 47]. A portion of it is 
shown on the next page. He considered it sufficien to leave just one decimal 
digit in the logarithms (Artificiales), while he had used seven in all preliminary 
computations to avoid error accumulation. It is not important to us that these 
logarithms are slightly erroneous due to the faulty last entry in the Second 
table (the last one of the 69th column should be 6934253.4), because we are 
interested in Napier’s invention and how he constructed the table, but not in 
using it. 

Then Napier embarked on the third stage of his work: the elaboration of 
a table of logarithms, which he called the principal table, for the sines of 
angles from 45° to 90° in steps of | minute. These can be obtained from 
any table of common sines. For each sine he found firs the entry in the 
Radical table that is closest to it. But now that seven-decimal-digit accuracy 
was not needed, the method of the fourth proportional becomes irrelevant, 
and even the use of the inequalities in Article 38. What counted at this stage 
was speed, so that the principal table can actually be constructed. So, given 
a sine and its closest entry in the Radical table, x and y with y > x, compute 
TS(y — x), and then, instead of dividing this product firs by y and then by x 
to fin the limits of log x — log y, just divide it by the easiest possible number 
between x and y. Then the result is an approximation of the true value of 
log x — log y, and the unknown logarithm can be approximately found from it. 
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Reproduced from page 25 of the 1620 edition of the Constrvctio. 


For example [Art. 50], if y = 7071068, the nearest sine in the Radical table is 
x = 7070084.4434 , and then TS(y — x) = 9835566000. Dividing this result 
by the easiest number between x and y, which Napier took to be 7071000, 
“there comes out” 1390.9. Thus, taking log x from the Radical table, gives us 


log y © log x — 1390.9 = 3467125.4 — 1390.9 = 3465734.5. 


“Wherefore 3465735 is assigned for the required logarithm of the given sine 
7071068.” 

Allit took from this point on was time and effort, and Napier was eventually 
able to compute the logarithms of the sines of all angles between 45° and 90° 
in steps of | minute. All of this by dipping his quill in ink and partly by 
candlelight, long division after long division. 
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The fina stage would be to deal with the angles between 0° and 45°, since 
we know that what really matters is what happens in the firs quadrant. To do 
this, Napier proposed two methods, both of which are based on the theorem 
in Article 36. 

First, if we use the equations after the quotation of this theorem with 
x = 2v, y = 5000000, and z = TS, we have 


log 2v — log v = log 10000000 — log 5000000, 


and taking the last logarithm from the previously computed part of the principal 
table, but before rounding it as an integer, yields 


log v = log2v + log 5000000 = log 2u + 6931469.22. 
Or as Napier put it [Art. 51], 


Therefore, also, 6931469.22 will be the difference of all logarithms whose 
sines are in the proportion of two to one. Consequently the double of it, 
namely 13862938.44, will be the difference of all logarithms whose sines are 
in the ratio of four to one; and the triple of it, namely 20794407.66, will be 
the difference of all logarithms whose sines are in the ratio of eight to one. 


Not content with this, he showed, in a similar manner, that 


52. All sines in the proportion of ten to one have 23025842.34 for the 
difference of their logarithms. 


That this figur should have been 23025850.93 is neither here nor there. What 
matters is that he could now fin the logarithms of the sines of angles below 
45°. To quote his own statement [Art. 54]: 


This is easily done by multiplying the given sine by 2, 4, 8, 10, 20, 40, 80, 
100, 200, or any other proportional number you please ... until you obtain 


a number within the limits of the Radical table. 


The second method is based on a trigonometric identity that he expressed 
as follows: 


55. As half radius is to the sine of half a given arc, so is the sine of the 
complement of the half arc to the sine of the whole arc. 


He provided an ingenious geometric proof of this result, which, if we keep in 
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mind that his sine is the product of the radius R and our sine, can be rewritten 
as follows: 


R/j2 ‘sin (90° — 3a) 


Rsin Sa Rsina 


Today we would write this result in the familiar form sina = 2 sin Sa cos 5a. 
Then he used the theorem in Article 36 again to obtain in verbal form a result 


that can be rewritten as 
log : — log (R sin 3) = log [R sin (90° = 32) — log (R sin a): 


Then, if 22.5° < Sa < 45°, the two logarithms on the right-hand side as well 
as that of R/2 are in the principal table, and the remaining logarithm can be 
determined. Napier concluded [Art. 58]: 


From these, again, may be had in like manner the logarithms of arcs down 
to 11 degrees 15 minutes. And from these the logarithms of arcs down to 5 
degrees 38 minutes. And so on, successively, down to 1 minute. 


And now there is nothing left but the actual construction of the logarithmic 
table. Napier gave precise instructions about how to do it step by step, starting 
with the following advice [Art. 59]: 


Prepare forty-fi e pages, somewhat long in shape, so that besides margins at 
the top and bottom, they may hold sixty lines of figures 


A page of the resulting table is reproduced on the next page. The publisher 
did not seem to have pages that could accommodate 60 lines of figures so he 
put two columns of 30 lines on each page. The firs column of this sample 
page contains the angles from 28° 0/ to 28° 30’, the second column lists their 
sines (probably taken from Erasmus Rheinhold’s table of sines, which Napier 
mentioned in particular), and the third their logarithms, computed as explained 
in the preceding articles. The rest of the angles corresponding to 28° would be 
on the next page, not shown. On arrival at the page containing the sines and 
logarithms for angles from 44° 30’ to 44° 60’ no new pages were used. Instead, 
the table continued on the right of that last page with angles increasing from 
45° 0’ at the bottom of the seventh column to 45° 30’ at the top. The angles 
located on both ends of each horizontal line are complementary. Their sines 
and logarithms are listed in the sixth and fift columns, respectively. Then the 
table continued in the same manner on the preceding page and then on those 
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Reproduced from Wright’s translation of the Descriptio. 


previously used, as shown on the sample page. The central column contains 
the differences between the logarithm on its left and that on its right. These 
are the logarithms of the sine of a given angle and of its cosine (which is equal 
to the sine of the complement), and it is useful in evaluating the logarithms of 
tangents when viewed as quotients of sine and cosine. 

It must be said for the sake of fairness that the earliest discoverer of log- 
arithms was Joost, or Jobst, Biirgi (1552-1632), a Swiss clockmaker, about 
1588. He seems to have been inspired by Stifel’s table, but he chose 1.0001 
as his starting number close to 1. Then, if N = 100000000 x 1.0001“, Biirgi 
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called 10L, not L, the “red number” (Rote Zahl) corresponding to the “black 
number” (Schwarze Zahl) N. These are the colors in which these numbers 
were printed in his 1620 tables Aritmetische vnd geometrische Progress Tab- 
ulen. Thus, he lost publication priority to Napier. 

Biirgi was in Prague as an assistant to the astronomer Johannes Kepler 
(1571-1630). Kepler had become an assistant to Tycho Brahe in 1598, and in 
1599 Brahe moved to Prague, attracted by an offer from the emperor Rudolf II 
(who reigned over the Sacrum Romanum Imperium in the period 1576-1612) 
and Kepler followed the year after. On Brahe’s death in 1601, Kepler became 
the officia astronomer and went on to discover the three laws of planetary 
motion. His work, based on Brahe’s astronomical observations, was assisted 
by the use of logarithms, and this helped spread their use in Europe. Biirgi 
remained an unknown not just because of questions of priority, but because 
most copies of his tables were lost during the Thirty Years’ War. The crucial 
Battle of the White Mountain, in which 7,000 men lost their lives, was fought 
just outside Prague, in November 1620. 


2.3 BRIGGS’ LOGARITHMS 


The firs mathematics professorship in England, a chair in geometry, was 
endowed by Sir Thomas Gresham in 1596 at Gresham College in London, and 
Henry Briggs (1561-1631) was its firs occupant. He was quite impressed by 
the invention of logarithms, and on March 10, 1615, he wrote to his friend 
James Usher: 


Napper, Lord of Markinston, hath set my Head and Hands a Work, with 
his new and admirable Logarithms. I hope to see him this Summer if it 
please God, for I never saw Book that pleased me better, and made me more 
wonder.2 


The astrologer William Lilly (1602-1681) wrote his autobiography in 1667— 
1668 in the form of a letter to a friend. It contains an interesting—but rather 
fantastic—account of Briggs’ arrival at Merchiston, stating that on meeting 
Napier, “almost one quarter of an hour was spent, each beholding the other 
almost with admiration, before one word was spoke.”"° 


9 Quoted from Parr, The life of the Most Reverend Father in God, James Usher, 1686, 
p. 36 of the collection of three hundred letters appended at the end of the volume. 
ie Lilly finishe his book with an account of the firs encounter between Napier and Briggs, 
pp. 235—238. This quotation is from p. 236. The entire account of Briggs’ arrival is reprinted 
(almost faithfully) in Bell, Men of mathematics, 1937, p. 526. 
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Having overcome this mutual admiration hiatus, they set down to work. 
For this there was ample time, since Briggs remained at Merchiston for about 
a month. Both men had become aware of some serious shortcomings in the 
logarithm scheme published by Napier. For instance, if x = yz, then 


aS 
1 
and, according to Article 36, log x — log y = logz — log 1, or 


log yz = log y + logz — log 1. 


This is as true today as it was then, but today log 1 = 0. In Napier’s scheme, it 
was the logarithm of the sinus totus that was zero. The logarithm of 1, anumber 
very close to S in the top segment TS on page 90, is an enormous number on 
the bottom line. It was computed by Napier to be 161180896.38 (while this is 
not entirely correct, because of the errors already mentioned, the true value in 
his own system is very close to it). Thus, the Neperian logarithm of a product 
is not the sum of the logarithms of the factors, and the Neperian logarithm of 1 
must be remembered every time the logarithm of a product is to be computed. 
Similarly for a quotient: if x = y/z, then log x — log 1 = log y — log z and 


log ~ = logy — logz + log 1. 


The second difficult in using Napier’s logarithms is in findin the logarithm 
of ten times a number whose logarithm is known. If that number is called x, 
the obvious equation 

10x | 10’ 

x 106 
leads to 
log 10x — logx = log 10’ — log 10°, 
and, since the logarithm of the sinus totus is zero, we obtain 
log 10x = log x — log 10°. 


What makes the computation of log 10x in this manner inconvenient is the fact 
that log 10° = 23025842. 

Both Napier and Briggs had been separately thinking of ways to solve 
these deficiencies and each of them made some proposals to accomplish that. 
These involved, in either case, the construction of a new set of logarithms 
from scratch. But they differed about the key properties on which the new 
logarithms should be based. Here is Briggs’ proposal (see below for reference): 
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When I explained their [the logarithms’] doctrine publicly in London to my 
auditors at Gresham College; I noticed that it would be very convenient in the 
future, if 0 were kept for the Logarithm of the whole Sine (as in the Canone 
Mirifico) while the Logarithm of the tenth part of the same whole Sine ... 
were 1,00000,00000 ... 


In short, Briggs proposed and started the construction ofa new set of logarithms 
based on the values log 10’ = 0 and log 10° = 10!°. This certainly simplifie 
the computation of log 10x from log x. 

Briggs wrote to Napier with this proposal, and later he journeyed to Edin- 
burgh, where they met for the firs time. It turns out that Napier had also been 
thinking about the change in the logarithms, and Briggs described Napier’s 
counterproposal as follows: 


But [he] recommended to therefore make this change, that 0 should be the 
Logarithm of Unity, & 1,00000,00000 [that of] the whole Sine: which I could 
not but acknowledge was most convenient by far. 


Indeed, with log1 = 0, it follows from the preceding discussion that the 
logarithm of a product or quotient becomes the sum or difference of the loga- 
rithms. So Briggs put aside the logarithms that he had already computed, and 
in the following summer he again journeyed to Edinburgh and showed Napier 
the new recomputed logarithms. They appeared firs in a small pamphlet 
of 16 pages, Logarithmorum chilias prima (The firs thousand logarithms), 
whose preface is dated 1617, the year of Napier’s death. In 1624 he published 
Arithmetica logarithmica sive logarithmorvm chiliades triginta (Logarithmic 
arithmetic or thirty thousands of logarithms), containing the new logarithms 
of the integers 1 to 20,000 and 90,000 to 100,000 (the reason for the large gap 
in the table will be given later). The quotations translated above are from the 
Preface to the Reader in this book. 

However, we should not create the impression that he elaborated the new 
logarithms on the basis of Napier’s proposal of the summer of 1615: that 
log 1 = 0 and log 10’ = 10!°. The firs of these equations he fully accepted 
[p. 2; 2-1],"1 for it makes the logarithm of a product or quotient the sum or 
difference of their logarithms [pp. 2,3; 2-2, 2-3].'*. Then it is easy to see 


11 References in square brackets are to the Arithmetica logarithmica. The firs page 
reference is to the original in Latin; the second, after the semicolon, to Bruce’s translation 
(see the bibliography) by chapter page (he does not have global page numbering). 

12 Briggs referred to these lemmas as axioms. These same properties, as well as the next 
one about the logarithm of a rational power, had been more clearly stated by Napier in the 
Appendix to the Constrvctio, pp. 50-51 of Macdonald’s translation. 
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by repeated application of the product rule that ifn is a positive integer, then 
log x” = nlog x. Also, if m is a positive integer and y = x’"/", then x” = y"” 
and m log x = nlog y, so that 


m m 
logx” = — logx, 
n 


which means that for a rational exponent logx” = rlogx. Thus, accepting 
the equation log 1 = 0, we get these very convenient properties that Napier’s 
own logarithms lack. However, Briggs rejected the second equation in favor 
of a new proposal made in the Appendix to the Constrvctio. This Appendix 
starts with the words [Macdonald translation, p. 48]: 


Among the various improvements of Logarithms, the more important is that 
which adopts a cypher as the Logarithm of unity, and 10,000,000,000 as the 
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Logarithm of either one tenth of unity or ten times unity. 


This is the suggestion that Briggs actually quasi-accepted, in the form 
log 10 = 10" [p. 3; 3-1]. The large number of zeros was intended to provide 
sufficien accuracy without resorting to numbers containing the still too new 
decimal point and digits. Soon thereafter, when decimal fractions were fully 
accepted, the best choice was log 10 = 1. This results in the same logarithms 
that Briggs computed but with a decimal point in their midst. 

But how are all these logarithms computed? At the time that Briggs was 
about to embark on the elaboration of his tables (Napier’s health was failing 
in his 65th year, so it was up to Briggs to start a new series of computations) 


JOHN NAPIER IN 1616 


Engraving by Samuel Freeman from the portrait 
in the possession of the University of Edinburgh. 
From Smith, Portraits of Eminent Mathematicians, Il. 


there were two main methods to calculate logarithms, and both of them had 
been published in the Appendix to Napier’s Constrvctio, as Briggs himself 
stated [p. 5; 5-2]. We do not know whether these are due to Napier or are a 
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product of his collaboration with Briggs at Merchiston. The firs method was 
based on the extraction of fift roots and the second on the extraction of square 
roots. Napier felt that “though this method [of the fift root] is considerably 
more difficult it is correspondingly more exact” [Appendix, p. 50]. Briggs 
chose the method of the square root, and started evaluating (by hand, of course, 
which may take several hours) the square root of 10 [p 10; 6—2]:'9 


10!/7 = /10 = 3.16227766016837933 199889354, 


so that log 3.16227766016837933199889354 = 0.5. Then, by successive 
extraction of square roots, he evaluated 10'/4, 10'/8, and so on, down to 


10'/2* — 1,000000000000000255638298640064707 
and [p. 10; 6—3] 


10'/2" — 1,000000000000000127819149320032345. 


Briggs’ values can be seen on the next page, reproduced from Chapter 6, 
page 10, of the second edition of Arithmetica logarithmica. Note that he did 
not use the decimal point, but inserted commas to help with counting spaces. 
On the logarithm side he omitted many zeros, which can be confusing. We 
know he chose the logarithm of 10 to be what we express as 10!*, but in this 
table it appears as 1,000. Down at the bottom, these logarithms are given with 
forty-one digits, making it difficul to know what an actual logarithm in this 
range is. To avoid possible misinterpretations, we follow the modern practice 
of using a decimal point for the numbers, but shall retain instead the firs 
of Briggs’ commas for their logarithms. If this comma is read as a decimal 
point, these are today’s decimal logarithms, while the Briggsian logarithms 
are 10'4 times larger. It must be pointed out that Briggs made a mistake in 
his computation of 10!/*. His digits are wrong from the twentieth on. This 
mistake trickles down through his entire table, but because the wrong digits 
are so far on the right, the error becomes smaller as it propagates, and his 
last two entries are almost in complete agreement with values obtained today 
using a computer. 

The end result of this stage of Briggs’ work is a table of logarithms, the 
table shown on page 107, displaying in the second column the logarithms of 


13 Briggs denoted the square root of 10 by /.10, meaning the /atus (side) of 10; that is, the 
side of a square of area 10. The square root of the square root of 10 would then be //.10, 
the eighth root /.(8)10, and so on. I have adopted our usual exponential notation, unknown 
at that time, as most convenient for our purposes. 


SECTION 2.3 


BriGGs’ LOGARITHMS 


107 


to 


Numeri continue Medy inter Denariam GV nitatem. 


D 4ARITHMTICA E 


to 


1 31622,77660,6837,93319,98893,5$ 


2 
3 
4 


17782,79410,03892,28011,97304,13 
1333321432, 10332,40256,65 389,308 
11547,81984,68945,81798,61918,213 


10746,07828,32131,74972,13817,6538 


10366,32928,43769,79972,90627,3131 
1o181,51721,718 18,18414,73723,8144 
10090,35044,84144,74377,5 9005, 139% 
1004.5,97354,25446,25156,64670,6113 


10022, f1148,29291,29154,65611,7367 


To041,24941,39987,9875?, 85395, 5L805 
1000§,62312,60220,86366,18495,91839 
10002, 811516,78778,01323,99249,64325 
ooo! 40548, 51694,7258%,62767,32715 


\£0000,70271,78941,143, 57,38811,70845 


Too00,35135,27745,18 566,0858 1,37077 
10000,17567,48442,267 38,33846,78274 


22 
er 


19000,08783,70363,46121,46574,07431 
10000, 04391,84217,31672,36281,88083 
10000,0219,91867,55 542,03317,07719 


10000,01097,95873,50204,09754,72940 


10000,00548,97921,68211,14626,60250,4 
10009,00274,48957,07382,9F091,25449,9 


24) £000,00137,24477, 595 10,93282,69572,5 


25 19000,00068,62238,55210,25737,18748,2 


26| 10000,00034,31119,22218,83912,75020,8 
27|10000,00017,15$59,59537,84719,93879,1 
28|10000,00908,57779,7945 1,03051,17588,8 
29| 10000,00004,28889,89633,54198,42901,3 
30] £0000,00002,14444,94793,77767342970,4 
31) 10000, 00005,07222,47391,14050,76926,8 
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38 
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£0000,00000,5361.1,23694,13317,14831,4 
T0000,00000,26$05,61846,70731,51508,7 
£0090,00900,13402,80923,26383,99277,7 
To000,00000,03 370,70230,79911,91730,0 
To000,00000,01675, 3511 $,39815,61857,6 
T0000,00990,90837,6757,69872,72426,9 
100090,09000,00418,83778,84927,59087,9 


10099,00000,00104,70944,71230,253II,0 
T0c00,00000,000§2,35472,35614,98950,4 
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10000,00000,00000,81804,255'56,48210,297 
19900,00000,00000,40902, 1277824 104,311 


£0009,00000,09000,2045 1,06389,1205 1,946 


T0900,00050,00209,41889,42461,60262,¢ 


£9090,90000,00006,54434,04451,85869,75° 


Logavithmi Rationales. 
T,000 
0,f0 
0,25 
0,125 
0,0625 
0,03125 
0,01 f62,5 
0,00781,25 
0,00390,625 
o,oorgs,3125 
0,00097,65625 
0,00048,32812,5 
0,00024,41406,25 
0,00012,20703,125° 
0,00006,103 51,5625 
'0,00003,0F175,78125 
0,00001, §2587,89062,5 
9,00000,76293,94531,2F5 
0,00000,38146,97265,625 
0,90000,19073,48632,8125 
0,00000,09§36,74316,40625" 
0,00000,04768,37158,20312,5 
0,00000,02384,18579,10156,2¢ 
0,00000,01192,09289, 55078, 125 
¥,90000,09596,04644,77539,0625 
0,00000,00298,02322,38760,53125 
0y00000,00149,01161,193 84,76 62,5 
0,00009,00074, f0580,59692,38281,25 
©,00000,00037,25290,29846, 19 140,625 
9,00000,00018,62645,14923,09570,3125' 
0,00000,00009,31322,57461,54785, 15625 
0,05000,00004,65661,28730,77392,57812,5 
©,00000,00602,32830,64365,38696,28926,25 
0,00000,0000] ,16415,32182,69348,14453,125 
0,00000,00690,58207,66091,34674,07226, 5625 
9,00000,00000,29103,83045,67337,03613,28125 
0,00900,00000,145 $1,91522,83668, $1806,64062,5 
9,00000,00000,07275,95761,4183 4,25903,3203 1,25 
0,00000,00000,03637,97880,70917,1295 1,660] 5,625- 
0,90000,00000,01818,98940,35458,56475,83007,8125 
9,00000,00600,00909,49470,17729,28237,01 J°3, 90629 
0,00000,00000,00454,7473 5,08864,64118,95751,05312 
9,09000,00000,00227,37367,54432,32059,47875,97656 
0,00000,00000,00113,68683,77216,16029,73937,98828 
9,00000,00000,00056,84341,88608,08014,86968, 99414 
9,00000,00000,00028,42170,94304,04007,43484,49707 
1,00000,00000,00014,2108 5,471 52,02003,71742,2485 3 
9300000,00000,00007,10542,73576, 01003 ,85871,12426 
9,90000,00000,00003, §'271,36788,00500,92935, 56213 
9,00000,00000,00001,77635,68394,00250,46467,78106 
9,00060,00000,00000,88817,84197,00124,23233, 89053 


———— 


£0000,00000,00000, 1022553194, 56025,921 L 9,90000,00000,00000,44468, 92098, 50062,61616,94526 
§2| 10000,09009,00000,05112;76597,28012,947 04 9,90000,00000,00000,22204,46049,25031,30808,47263 
53 j$9000,00000,c0000,02555,38298,54006,470 IN| 9,09000,00000,00000, 11 102,23024, 62515, 65404,23635 
54 109000,000c0,00060,01278,19149,32003,235 P|o,coong,c0000,00000,05 $$ 1,11 J12,31257,82702, 11855 
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the 54 weird numbers in the firs column. But we would prefer to have a 
table of the logarithms of more ordinary numbers such as 2, 5, 17, or—to 
be systematic—what about the logarithms of the firs 1000 positive integers? 
This was, precisely, Briggs’ original goal, and the logarithms of such numbers 
can also be obtained by the successive extraction of square roots. 

To see how this is done we can examine the last four rows of the reproduced 
table, those containing the entries labeled L to P. If we write P in the form 
1 + p with p = 0.0000000000000001278 1914932003235, ashort calculation 
shows that VN = 1+2p,M ~ 14+4p,and L © 1+8p. The approximation loses 
accuracy as we progress through these numbers, in reverse alphabetical order, 
so we shall not go any further. Now, log P is the last number in the right column 
of the table, namely 0,0000000000000000555111512312578270211815, and 
using the fact that p < | it is easy to see that 


log N = log(1 + 2p) © log(1 + p)* = 2log P, 


log M © log(1 + 4p) © log(1 + p)* = 4 log P, 
and 

log L © log(1 + 8p) © log(1 + p)® = 8 log P. 
This may be generously interpreted to mean that if a number of the form 1 +r 
is “near the numbers L M N & P” then 


log(1 +r) = log (1 + “p) x ” tog P. 
P Pp 


Briggs did not state any formulas and simply asserted that “the Logarithm of 
this number [a number in the stated range] is easily found, from the rule of 
proportion” [p. 11; 6—4], which he called the “golden rule” (auream regulam). 
He illustrated it with the number 1.0000000000000001, in which case 


r  10000000000000000 
p  12781914932003235 


and asserted that its logarithm is 0,000000000000000043429448 1903251804 
(the last three digits are in error). 

It may be more interesting to fin log2, although Briggs did not do it at 
this point in his book for reasons that will become apparent only too soon. We 
would start by computing 53 consecutive square roots of 2, ending with 


2'/2" — 1.00000000000000007695479593 116620. 
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This number is in the L to P range in the firs column of Briggs’ table, and 
then the golden rule (in which we have replaced * with = for convenience) 
and the stated value of log P give 


93 _ 07695479593116620 
~ 12781914932003235 


= 0,0000000000000000334210432288962932791931. 


log 2!/ 


Therefore, 


log 2 = (2°3)(0,00000000000000003342 10432288962932791931) 
= 0,30102999566398116973197. 


This method has two drawbacks. First, the accuracy is somewhat limited (the 
last six decimal digits are incorrect), although it may be considered acceptable 
for a table of logarithms. But the method is too labor-intensive. 4 

In Chapter VII, Briggs started exploring how to shorten his methods, and 
actually computed log 2 as pollows: First he noticed that it is sufficien to 
compute the logarithm of 1-224 ~ (since he did not use a decimal point, this is 
how he wrote 1.024) by the square root method, because, since 1024 is the 
tenth power of 2, 


log 1024 
log2 = = 
and 
log 1024 = log 1000 + log 1-04, = 3 log 10 + log 1-24. 


Since 1.024 is closer to unity than 2, then 47 (as opposed to 53) square root ex- 
tractions of 1.024 give the number 1.0000000000000001685 1605705394977, 
which is in the proportionality region L to P, and “the Logarithm of that num- 
ber is found by the golden rule 0,000000000000000073 1855936906239336” 
[p. 13; 7-3]. Thus, 


log 1.024 = (247)(0,0000000000000000731855936906239336) 
= 0,01029995663981195. 


Adding 3 log 10 and dividing by 10 yields log2 = 0,301029995663981195, 
as given in the Arithmetica logarithmica [p. 14; 7-4]. 


14 Bruce has estimated that the computation of about fift successive square roots is 
a process that may require between 100 and 200 hours of hard work [p. 7-6]. This is 
equivalent to several weeks if you have other commitments. 
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Then Briggs used the equation 5/10 = 1/2 and the properties of logarithms 
to obtain log 5 — log 10 = log 1 — log 2, from which [p. 14; 7-5] 


log 5 = log 10— log2 
= 1,000000000000000000 — 0,30102999566398 1195 
= 0,698970004336018805. 


Next, he gave a list of numbers whose logarithms can be easily computed from 
those previously computed using the properties of logarithms: 


From the multiplication of Two alone, by itself & into its factors, 4. 8. 16. 32. 
64. &c. Likewise of Five by itself & into its factors, 25. 125. 625.3125. &c, 
Two in factors of Five, 250. 1250. 6250. &c. Two into Ten 20. 200. 2000. 
40. 400. 80. 800. &c. 


The next prime after two is three, “whose Logarithm is most conveniently 
found from the Logarithm of Six,” since log 3 = log 6 — log 2. But six is very 
far from one, and it would take too many roots to bring it down to the L — P 
proportionality region. Thus, Briggs decided to fin the logarithm of 


69 
io? = 1.0077696 
instead, which he did again by the golden rule, and then obtained 
_ 7+ log 1.0077696— 
9 


log 6 


It should be evident that the elaboration of a complete table of logarithms 
by the square root method would be impossibly time-consuming. However, 
the logarithm of 6 was the last one that Briggs computed in this manner, for 
he made an additional discovery. He announced it at the start of Chapter 8 in 
the following terms [p. 15; 8—1]: 


We can fin the Logarithm of any proposed number according to this method, 
by continued Means, which supplies our need by findin the square root 
laboriously enough, but the vexation of this enormous labor is considerably 
lessened through differences. 


To illustrate this idea he chose the example of findin consecutive square 
roots of 1.0077696, the number involved in findin the logarithm of six. After 
computing some roots, as shown in the table reproduced on page 112 [p. 16; 
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8—2], he noticed that if each of them is written in the form 1 + A, then each 
value of A is approximately twice the next value. For instance, the A part of 
the root in line 43 of this table (0.00048384026884662985492535) is about 
twice the A part in the top line of box 42 (0.00024189087882468563808727). 
Then Briggs decided to investigate the difference between half of each A and 
the next A, which he called B. The firs of his B differences is seen in the 
third line of box 42, and it is a very small number in comparison with the A 
values under consideration. Using the next two A values, Briggs computed a 
second B, which is in the third line of box 41, and observed that it is about 
one quarter of the previous B. So he denoted by C the difference between this 
quarter and the second B, and noticed that C is very small in comparison to 
the B values. 

The fact that Briggs—or any of his contemporaries for that matter—did 
not use subscripts, added to the fact that his table contains only one column, 
may make things difficul for the present-day reader, so another table has been 
included on page 113 to show things in current notation. In its firs column, 
indicates the order of the root; in the second these roots are given in the 
form | + Ay, using the decimal point that Briggs omitted; and the remaining 
columns give the differences of several orders as computed by Briggs (some of 
his digits are incorrect, if we redo his work using a computer) and labeled by 
the headers. Leading zeros have been omitted, as originally done by Briggs, 
but without his vertical arrangement this may be confusing if not noted. 

What is clear is that these successive differences become smaller and 
smaller, to the point that 


1 
Fo = — Eg — Ey ~ 0 


with the 26-decimal-digit accuracy used in these computations. At this point 
Briggs got the idea of working backward through the differences. If we take 
F\o = 0, since Fjo is even smaller than Fo, then 


0= Fio= =e — Eo 
implies that E;9 = Eo = 0.0000,00000,00000,00000,00000,065. In the 
same manner, Dip = +. Do — E 9 = 0.0000,00000,00000,00000,02855,524, 
and so on, as shown in the lower table on page 113, until we obtain Ajo = 
0000,07558,20443,63012,14290,760, and then 1 + Ajo = 1+ Ao. It is 
evident that this method of evaluation of differences is much faster than directly 
findin the square root. Briggs illustrated this procedure again to compute A), 
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10077,696 
4.6|10038,77283,33696,24566, 38465, 51 
AS |10019,36766,13694,,66167, 5870229 
4-4.110009,67914,63909,90172,88907,20 
43] 10004,83840,25884,66298, 54025,35 4 
42110002,41890,87882,46856,38087,27 4 

254192051 344.2,3314.9,27402,67 i 
29525559386292,89375:40 Bp 

41) 10001,20938,12639,71345,94391,94 4 
120945143 9415234.28,19043,63 54 
7531301,52082,24651,69 B 
7231389,96573,22343,85 iB 

| 83,444.90,97692,15 


40 10900 ,60467,23 505,53096,8a160,05 

604.69,06319,85672,97195,97 id 

1,82814,32576,17035,92 B 

1,82825,38020,56162,92 1B 
T15054-44..39127,00 

II,OF§61,37211,52 3C 

116,98084,52 D 


39 10000,39233,16050,56577,59647,04 
39233,61752,76548,40080,02 14 
4.5702,19970,80432,08 B 
45703558144,04258,98 1B 
1,38173,23826,900 C 
1,38180,54.890,87 $C 
7531063,97 D| 

7531130,28 ~D 

66,31 E 


38 10000,1§116546599,90567,29504, 88 4 
T5116, 58025 ,28288 79823, 97 tA 
1¥425,37721,50319,09 B 
¥ 11425554992,70108,02 3B 
a ihe Cc 

17271,654.78,26 
Hucufgue Differentie mi- Mito 2 
nores funt inyenta per fub- 45591,50 =D 


Ss 


d :&ionem majorum é par- 2,07 E 
t bus homugenearum pre- 2,07 E 
cedentium, 7 


— 
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(his own table of backward differences, which was printed right below that 
of forward differences, shows a similar vertical arrangement of the A to D 
differences).1° 


15 Briggs gave a second method for computing the differences, which is based on evalu- 
ating successive powers. Fix a value of n and note that if 1 + A, = ./1 + Ay_1, then 


An-1 = a “F An)” —1, An-2 =(1+ An—1)" -1l= @ =F An)* —i, 


and so on. Therefore, 


1 
By, = = An-1 a An = = at + A,r — 1] - An = 5 An 


and 


Cn = Bn 1 Bn = alz4e 2 An-1| 5A 

= ,[+4,)*-1]- [a +A,/ -1)- 5a 
il 3 I 4 

~~ 5 Ant gin: 


Similarly, we would obtain 


_7 7 ; 
= GA, + g4n tant SAL +2 “AR 
ie 6 4 175 47, 1605 yg, 715 yo , 301 aio 
En = gy Ay + TAy + 6 An + ag An + Gy An t og An 


and higher-order differences. We know that Briggs did not use subscripts, but in writing 
these equations he did not even use the letter A. The stated equations were printed in the 
Arithmetica logarithmica as 


Secunda 2) 

Tertia £3 }4+y 

Quarta % WOHDa8) 

Quinta 246)+7(/6)+101% APO) 3@) 


He went down to “Decima” but he never wrote beyond the tenth power of A,. 

This method is not faster than the previous one, but it is of some theoretical interest, if 
only because of what Briggs could have done with it but didn’t. Using the equations in the 
header of the lower table on page 113, we see that 


(1+ An)? = 14 Angi = 14 5An — Brot = 14+ 54n — (4B — Ca41) 
= 14+ 4A, — 4B, + ($Cn— Dn4t) 

1+ 4An — 4Bn + 4Cn — (Dn — En+1) 

1+ 4A, — 4B, +4C,— Dnt bEn— 
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Giving a full account of the rest of Briggs’ methods to save time in the 
computation of logarithms would bea very long story. In Chapter 9 he used the 
methods already explained and some clever ways of writing certain numbers 
as sums and products to evaluate the logarithms of all the primes from 2 to 97. 
Then the rules of logarithms provide those of whole composite numbers, and 
in Chapter 10 he dealt with fractions. In the next three chapters, Briggs 
presented several methods of interpolation, starting with simple proportion in 
Chapter 11. In Chapter 12 he considered a region of the table in which second- 
order differences for successive numbers whose logarithms are known remain 
nearly constant, and developed a method to fin nine logarithms of equally 
spaced numbers between every two of those successive numbers. This is 
known today as Newton’s forward difference method because it was later 
rediscovered by Newton (it is briefl described in Section 4.3). 


With the aid of the method of finit differences of Chapter 8, Briggs com- 
puted a firs table of the new logarithms of the numbers | to 1000 in Loga- 
rithmorum chilias prima. In 1620, between this publication and that of the 
Arithmetica logarithmica, he had become the firs Savilian professor of ge- 
ometry at Oxford—a chair endowed by Sir Henry Savile. As we have already 
mentioned, the 1624 edition of the Arithmetica logarithmica contains no log- 
arithms of the numbers between 20,000 and 90,000. The reason for this is that 
the methods of Chapter 12, used to compute the printed logarithms, are not ap- 
plicable to this range. Briggs proposed a new method in a longer Chapter 13, 
bearing the following very long title: 


Bringing now to the right-hand side the values obtained for B, to E, by Briggs’ second 
method, we obtain 


(1+ An)? = 14+ 4An— gA2 + 3 (543 + 7 A4) 
—i(Z At + zA0+ 7 TEAS +4 A) +¢ aA’) 
21 45 6 i as 1605 48 
5 (As + 7Ao + TBAT + BAR +...) 
1 42 3 5 4 5 
An gAn + Tae 128 An + sean 


+35 


Nie 


ifi_ 1 
= 14+5A,+ 203 - ly ag 2(3 ts 2) 43 Sons 


Today we recognize this as the binomial series expansion of (1 + A,)!/?. Although we 
may like to think that this particular case of the binomial theorem was implicitly contained 
in Briggs’ work (this was firs pointed out by Whiteside in “Henry Briggs: the binomial 
theorem anticipated,” 1961), the fact is that Briggs, who had bigger fis to fry at that point, 
was not aware of it. 
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It is desired to fin the Logarithms of any Chiliad [that is, one thousand 
consecutive numbers]. Or given any set of equidistant numbers, together with 
the Logarithms of these four numbers, to fin the Logarithms of intermediate 
numbers for each interval [between the given numbers]. 


The second paragraph begins as follows: 

Take the first second, third, fourth, &c. differences of the given Logarithms; 
& divide the firs by 5, the second by 25, the third by 125, &c; ... the 
quotients are called the first second, third, &c. mean differences, ... 


He hoped to use it to compute the logarithms of the missing chiliads. He 
expressed this wish, four years later, in a letter to John Pell of 25 October 
1628: 17 


My desire was to have these Chiliades that are wanting betwixt 20 and 90 
calculated and printed, and I had done them all almost by myselfe, and by 
some frends whom my rules had sufficient! informed, and by agreement the 
business was conveniently parted amongst us : but I am eased of that charge 
and care by one Adrian Vlacque ... But he hathe cut off 4 of my figure 
throughout : and hathe left out my dedication, and to the reader, and two 
chapters the 12 and 13, in the rest he hath not varied fromme at all. 


What happened is that the large gap in Briggs’ Arithmetica logarithmica of 
1624 was fille by the Dutch publisher Adriaan Vlacq (c. 1600-1667) (or 
Ezechiel de Decker, a Dutch surveyor, assisted by Vlacq), but reducing the 
accuracy of the logarithms from 14 to 10 digits and eliminating Chapters 12 
and 13, the summit of Briggs’ work on finit and mean differences. Their 
work was published, without any notice to Briggs, in a second edition of the 
Arithmetica logarithmica. 

Briggs turned his attention to a new project, the Trigonometria britannica. 
After expressing his displeasure about the omission of Chapter 13 in the Dutch 
edition of the Arithmetica logarithmica, he included his method of mean dif- 
ferences in Chapter 12 of this project. Briggs would work on it for the rest 
of his life but could not finis it. He left it in the hands of his friend Henry 
Gellibrand (1597-1636), professor of astronomy at Gresham College, who 
was able to complete it before his premature death and published it in 1633. 


16 This and the preceding interpolation methods of Chapters 11 and 12 can be seen in 
modern notation in Goldstine, A history of numerical analysis from the 16th through the 
19th century, 1977, pp. 23-32. See also Bruce’s translation and his notes to these chapters. 

17 From Bruce, “Biographical Notes on Henry Briggs.” p. 7. 
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2.4 HYPERBOLIC LOGARITHMS 


The loss of Biirgi’s tables was not the only effect of the Thirty Years’ War on 
the development of logarithms. A Belgian Jesuit residing in Prague, Grégoire 
de Saint- Vincent (1584-1667), had previously made a discovery while inves- 


GREGOIRE DE SAINT-VINCENT IN 1653 
Engraving by Richard Collin. 
From Opus geometricum posthumum ad mesolabium, Ghent, 1688. 


tigating the area under the hyperbola that would eventually greatly facilitate 
the computation of logarithms. But its publication was delayed because Saint- 
Vincent fle from Prague (the Swedes are coming!) in 1631, at the start of the 
third phase of the Thirty Years’ War. 

King Gustavus Adolphus II had landed with his troops on the coast of 
Pomerania in 1630 and set about to the invasion of central Europe. He won a 
brilliant victory at the Battle of Leipzig on September 17, 1631, but later died 
at the battle of Liitzen on November 16, 1632. The Swedes were victorious, 
but Gustavus Adolphus was mortally wounded. In his haste to depart, Saint- 
Vincent left all his papers behind, but they were returned to him about ten 
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years later, and his research on the squaring of the circle and the hyperbola 
was published in 1647 in a book of over 1250 pages: Opvs geometricvm 
quadratvre circvli et sectionvm coni. 

The result on the hyperbola mentioned above is contained in several propo- 
sitions in Book VI, in particular in Proposition 109, on page 586, which is 
reproduced here. The text of the proposition can be translated as follows: 


PROPOSITIO CIX. 


Sit AB, AC afymproti hyperbole DEF : diuisaque AC, ve AG, 
AH,ALAK,AC continue (int proportionales, ponantur G D,EH, 
LI,MK,FC, ipfi AB zquidiftantes. 
Dico HD, IE , KL,CM 
fegmenta effe equalia. 


Demonfiratio. 


Voniam AG» AH, AI ,d- 
QNrroportionales fant, D GEHe 
LI, &c, camdem ge Hgeat 

continuanterationem; vnde Ln 
fia pie anand Sy : EI ffegmenta fant xequalia, Simf- 
a tercim EH, LI,MK, EC in cadem fiat proportione, ¢ xqualia quoque fant 
$2 gmenra EI, LK,MC., Contac igitur veritas propofitionis. 


Let AB, AC be the asymptotes of the hyperbola DEF : break up AC, as AG, 
AH, Al, AK, AC [so that they] are in continued proportion, place GD, EH, 
LI, MK, FC, equidistant from [meaning parallel to] AB. 

I say that HD, IE, KL, CM are equal [area] patches. 


In our terms, this means that if 


AH Al AK AC 
AG AH Al AK 


then the hyperbolic areas DGHE, EHIL, LIKM, MKCF, ... are equal. 
In other words, if we denote the ratio AH/AG by r, the abscissas AG, 
AH = AGr, 


Al = AHr=AGr*, AK =Alr=AGr?, AC =AKr = AGr'", 
and so on, form a geometric progression, and then the hyperbolic areas DGHE, 


DGIL, DGKM, DGCF, ... form an arithmetic progression. If AG = 1, this 
is the same type of relationship that Michael Stifel had pointed out between 
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successive powers of 2 and the corresponding exponents (page 83). We can 
conclude, in current terminology, that the relationship between the area under 
the hyperbola from x = | to an arbitrary abscissa x > 1 and the value of x is 
logarithmic. Saint-Vincent did not explicitly note this, but one of his students, 
Alfonso Antonio de Sarasa (1618—1667), did in his solution to a problem 
proposed by Marin Mersenne, a Minimite friar, in 1648. The next year, in 
his Solvtio problematis a R P Marino Mersenno Minimo propositi, de Sarasa 
stated the problem in this way:'8 


Given three arbitrary magnitudes, rational or irrational, and given the loga- 
rithms of two, to fin the logarithm of the third geometrically. 


De Sarasa solved Mersenne’s problem in developing his Proposition 10, 
which simply restates a particular case of the problem as follows: 


Given three magnitudes, A, B, and C, which can be shown in one and the same 
geometric progression, and given the logarithms of two of these magnitudes, 
say those of A and B, to determine the logarithm of the third, C, geometrically. 


That is, de Sarasa assumed that A, B, and C are terms of a geometric progres- 
sion, because this is the only case he thought he could solve. They are shown 
in the next figur as ordinates GH = A, 1K = B,and LF = C ofa hyper- 


A 


B Q 
Cc 


6 8 10 12 14 16 18 


F N S V K YHE 


bola, and we can think of the remaining shown ordinates as equally spaced 


18 This and the next two quotations are from Burn, “Alphonse Antonio de Sarasa and 
logarithms,” 2001. This paper contains a detailed analysis of de Sarasa’s Solutio. I follow 
a small part of Burn’s presentation here since I have been unable to see the original work. 
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terms of the same geometric progression. If some ordinates of the hyperbola 
y = 1/x (de Sarasa did not give an equation) are in geometric progression, 
then the corresponding abscissas are also in geometric progression. Then, by 
Saint-Vincent’s Proposition 109, the area K L is four times the area NL, and 
the area GK is twice the area NL, at which point de Sarasa observed: 


Whence these areas can fil the place of the given logarithms (Unde hae 
superficies supplere possunt locum logarithmorum datorum). 


This observation allowed de Sarasa to solve the modifie Mersenne prob- 
lem, which can be done quickly in current notation. Assume that there are 
real numbers a > 0 and 0 <r < 1, and positive integers m and n such that 


GH =ar", XY =ar™*", IK =ar™**", and) LF =ar™*®". 


Let S denote the area under the hyperbola between any two of these consecutive 
ordinates. For example, NL = S. If we refer to the exponents m, m + 2n, and 
m + 6n as the logarithms of A = GH, B = 1K, and C = LK, respectively, 
then it is clear that the differences of these logarithms, 2n and 4n, are equal 
to n times the area ratios GK /S and K L/S. We can write this as follows: 


GK 
log B— log A ee 


and 
KL 
log C — log B = ar a 

As an example, de Sarasa considered the case in which the numbers 6 to 18 
shown in his figur_ are the logarithms of the ordinates GH to LF. Thenn = 2, 
and the shown ordinates represent every other term of their progression. We 
have log A = 6, log B = 10, and KL/S = 4. It follows that log C — 10 = 8 
and log C = 18. 

What is important to us is de Sarasa’s realization that areas between the 
hyperbola and the horizontal axis are like logarithms. While he kept things 
general, we need to specify. Thus, consider the hyperbola y = 1/x and let 
A(x) denote the area under it from 1 to some x > 1. It can be called, at 
least for now, the hyperbolic logarithm of x. If we denote this logarithm by 
Hlog, we have A(x) = Hlog x. Since we have A(1) = Hlog 1 = 0, then the 
hyperbolic logarithm shares this property with the Briggsian logarithm. Is it 
the Briggsian logarithm? We shall have the answer shortly. 

The Scottish mathematician James Gregory (1638-1675) elaborated on 
the work of Saint-Vincent and de Sarasa and computed a number of hyper- 
bolic logarithms in a 1667 short book published in Padua, where he resided 
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at the time: Vera circvli et hyperbole qvadratvra, in propria sua proportio- 
nis specie, inuenta, & demonstrata (page references are to this original). In 
Proposition XXXII [p. 46], Gregory posed the problem of findin areas un- 
der the hyperbola D/L with asymptotes AO and AK. To do that he chose 


MHF 


the lengths 7K = 1000000000000, LM = 10000000000000 = 10/K, and 
AM = 1000000000000 = / K (Gregory’s figur is clearly not to scale). Then 
he employed a series of polygons with an increasing number of sides, inscribed 
and circumscribed to the hyperbolic space LIKM [pp. 47—48], and was able 
to approximate its area to be 23025850929940456240178700 [p. 49]. 
Gregory was, at the time, using very large numbers to avoid the still unfa- 
miliar decimal point. If we choose a positive x-axis in the direction AO and 
a positive y-axis in the direction AK (a choice that is as good as the opposite, 
given that both are asymptotes of the hyperbola), we would write the equation 
of Gregory’s hyperbola as y = 10*>/x. If instead we divide all the y-values by 
10! and all the x-values by 10!3 (which is equivalent to the choice 1K = 1, 
LM = 10, and AM = 0.1), then the equation of the hyperbola becomes the 
familiar y = 1/x, and the area stated above must be divided by 107°.'9 Thus, 


19 Gregory himself made the choice JK = 1 and LM = 10 in Proposition XXXII, 
entitled: Jt is proposed to find the logarithm of any number whatever [p. 49]. 
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LIKM = 2.3025850929940456240178700. Notice that for y = 1/x we have 
AK = AP = 1, and therefore each of the rectangles JQ@MK and LQ PO has 
area 0.9. Then the hyperbolic areas LJKM and ILOP are equal. The fig 
ure clearly shows that LO P = Hlog AQ, that is, that LIK M = Hlog LM. 
Therefore, since LM = 10, 


Hlog 10 = 2.3025850929940456240178700. 


It is clear from a comparison of this value with the Briggsian logarithm of 10 
that Hlog is a new logarithm. One, we must say in a spoiler mood, that would 
go on to a position of prominence in the future. 

In the few remaining pages of this book Gregory showed how to evaluate 
additional hyperbolic logarithms [pp. 53-55] and considered the problem of 
findin a number from its logarithm [p. 56]. 

At the end of his stay in Italy, in 1668, Gregory published a second book: 
Geometrie pars vniversalis (The universal part of geometry). In the preface, 
on pages six to eight, he gave the firs graph ever of the hyperbolic logarithm 
(shown below, with the positive x-axis in the direction OL and OC = 1) and 
stated for the firs time in print the properties of this curve. 


2.5 NEWTON’S BINOMIAL SERIES 


Sometime between the publication of de Sarasa’s and Gregory’s works, a 
young Isaac Newton (1642-1727) was fooling around with Pascal’s triangle 
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IsAAC NEWTON IN 1689 


Portrait by Sir Godfrey Kneller. 
Farleigh House, Farleigh Wallop, Hampshire. 


(which was known way before Pascal), giving the coefficient of the expansion 
of (a+ b)" forn = 0,1,2,... This triangle can be rewritten in a rectangular 
arrangement, with all its entries moved to the left and zero-fille on the right, 
as shown in the next table: 


n=0 1 0 0 0 0 0 
n=1 1 1 0 0 0 0 
n=2 12 1 0 0 0 
n=3 1 3 3 1 0 0 
n=4 14 6 4 1 #0 


This had already been done by Stifel in his Arithmetica integra, and Newton 
wrote it also as a rectangular array, but turning the rows into columns, in some 
manuscripts now preserved at the University Library, Cambridge, but never 
published in his lifetime. The second of these manuscripts, which started as 
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a redraft of the firs and was possibly written in the autumn of 1665,2° is 
reproduced on page 126, and we shall present firs the result obtained on the 
top third of the page: the computation of the area under the hyperbola 


_ 1 
~ 14x 


from the origin at d (it looks like 0 in Newton’s hand) to an arbitrary point e 
at some x > 0. Newton made the following observation about his vertical 
arrangement of Pascal’s triangle: 


be 


The composition of w" table may be deduced from hence, viz: The sume of 
any figur & y° figur above it is equall to y° figur following it. 


Referring instead to the horizontal arrangement by Stifel, which is more famil- 
iar to us, this can be translated as follows: any entry m, other than the first in 
a given row is the sum of two entries in the previous row: the one above m and 
the one to its left. Viewing things in this way is important because it allowed 
Newton to construct a new column to the left of the one for n = 0 with the 
same property. If we insist on a horizontal arrangement, this gives us a new 
row above the one for n = 0, as follows: 


r1= 


Ss 8s 3 8 
II 

BRwWNrOS | 
ll ll 
BRWNK Oe 
NWe OC 
he cCoOof. 
rPeoocoor 
oe oq oo Oo — 


Notice that now there is no zero-fil on the right in the new firs row. It 
continues indefinitel as an unending string of alternating 1’s and —1’s. If the 
new row is valid, it gives the following expansion: 


eth =a" =e b40- Fe be Pe 
and, in the particular case a = 1 and b = x, we obtain the infinit series 


(+x) }=1—-x4+x? e424 x4 


20 It appears under the modern title “Further development of the binomial expansion” in 
Whiteside, The mathematical papers of Isaac Newton, 1, 1967, pp. 122—134. The table in 
question is on p. 122. 
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This is the firs instance of what we call today Newton’s binomial series, and 
it is known to be a valid expansion for small values of x. Newton did not write 
this expansion explicitly in the manuscript under discussion, but it is clear that 
this is what he had in mind when he computed the area under the hyperbola 
be =(1+x)7!. 

To do that, he considered firs the graphs of the polynomials 


1 il+x, 142x+2x7, 143x43xx+2x3, &c. 


(in the style of his time, Newton frequently wrote xx instead of x”), whose 
coefficient are given by the other rows of the table above, and stated that the 
areas under these graphs from 0 to x > 0 are given by 


xXx 2xx x 3xx° 3° _% &e2! 
Xi; Beng a eg Fg Bag Oe ge Cc. 
And then, making a leap of faith, he assumed that the same procedure would 
apply to findin the area under (1 + x)~!, represented by the infinit series 
whose coefficient are given by the entries in the top row of the extended table. 
Newton concluded with the following statement: 


By w*" table [this is where he implicitly assumes the series expansion for 
the hyperbola] it may appeare y' y° area of the hyperbola abed [meaning the 
area under y = (1+ x)7! from 0 to x > OJis 


If we note that Newton’s hyperbola, y = 1/(1+ x), isa translation of y = 1/x 
to the left by one unit, then the area that Newton obtained is, in the notation 
introduced in Section 2.4, the same as the area A(1 + x) under y = 1/x from 
x = 1to1l+.x > 1. Thus, we can rewrite Newton’s discovery as 


2 3 4 5 6 


x. Z 
Hl 1 = A(1 = + &c. 
og (1+ x) (l+x)=x ae ats F c 


Newton obtained this unpublished result 


between the years 1664 & 1665. At w*" time I found the method of Infinit 


21 Newton had already figure out (although he was not the firs to do so) that for n = 
0,1,2,... the area under y = x” between 0 and x > 0 is x"! I(n +1). 
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From Whiteside, The mathematical papers of Isaac Newton, I. 
Facing the title page. 
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series. And in summer 1665 being forced from Cambridge by the Plague 22 
I computed y° area of y° Hyperbola at Boothby in Lincolnshire to two & fift 
figure by the same method.?9 


Then, in a manuscript probably written in 1667,24 Newton considered 
again the area under the hyperbola y = (1 + x)~!, represented in the next 
figur with the origin at b and in which ab = bc = 1. After restating the area 


under this hyperbola as an infinit series and making some calculations, he 
made the following statement [p. 186]: 


Now since the lines ad, ae, &c: beare such respect to y° superficie [areas] 
bcfd, bche, &c: as numbers to their logarithmes; (viz: as y° lines ad, ae, 
&c: increase in Geometricall Progression, so y* superficie bef d, bche, &c: 
increase in Arithmeticall Progression ): Therefore if any two or more of those 
lines multiplying or dividing one another doe produce some other like ak, 
their correspondent superficies added or subtracted one to or from another 
shall produce y* superficie begk correspondent to y' line ak. 


To interpret this statement in today’s notation let x and y be the abscissas of the 
points d and e, respectively, so that the lengths ad and ae are 1+.x and1+y. 


22 This was the great plague that took nearly 70,000 lives in London alone, and Cambridge 
University was closed. 

23 Newton made this statement on July 4, 1699, in one of his notebooks containing old 
annotations on John Wallis’ work. It is quoted from Whiteside, The mathematical papers 
of Isaac Newton, I, p. 8. 

24 Reproduced in Whiteside, The mathematical papers of Isaac Newton, II, 1968, pp. 184— 
189. Page references are to this printing. 
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In view of the previous interpretation of areas under this hyperbola as values of 
H log, the areas bcfd and bche are H log (1 + x) and Hlog (1+ y). Then, ifak 
is the product of the lengths ad and ae, the area bcegk is Hlog(1+ x)(1+ y), 
and Newton’s statement is that 


Hlog(1++x)1+y) =Hlog(1+.x«)+Hlog(1+y). 


Similarly, 
+x 


H1 
087 


= Hlog(1 +x) —Hlog(1+y). 


Thus, what Newton gave us in this manuscript is a statement (possibly the firs 
ever) of the properties of hyperbolic logarithms. He illustrated these rules by 
the computation of a number of hyperbolic logarithms to 57 decimal figure 

[pp. 187-188]. Finally, to show that he could trust these rules, he computed 
the hyperbolic logarithm of 0.9984 in two ways: firs by repeatedly using the 
stated rules with 


2x2x2x2x2x2x2x2x3x 13 
10000 


and then writing it as 0.9984 = 1 + (—0.0016) and using the infinit series. 
These two results (the firs contains a minor error) agreed “in more y" 50 
figures [p. 189]. 

Newton, who did not publish the preceding work, may have been dis- 
appointed in 1668, when other mathematicians published their work on the 
quadrature of the hyperbola using infinit series. The first in April, was 
William, Viscount Brouncker (1620-1684), at the time the firs president of 
the Royal Society.2° The hyperbola y = 1/x is represented by the curve EC 
in the next figure in which AB is a segment of the x-axis from x = 1 tox = 2, 
and the positive y-axis, not shown, is directed down. The area ABCdEA is 
bounded between the sum of the areas of the parallelograms with diagonals 
CA, dF, bn, fk, ap, cm, el, gh, &c. and the area of the parallelogram ABDE 
minus the sum of the areas of the triangles (not explicitly drawn) EDC, EdC, 
Ebd, dfC, Eab, bcd, def, fgC, &c. In this manner Brouncker was able to 
conclude that 


0.9984 = 


1 1 1 1 1 1 
ABCdEA = + t t &e. 
ae [xo DMO Sed Seco ee ee 


25 “The squaring of the hyperbola, by an infinit series of rational numbers, together with 
its demonstration, by that eminent mathematician, the Right Honourable the Lord Viscount 
Brouncker,’ 1668. 
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in infinitum [p. 646]. This is what we now call the logarithm oftwo. Brouncker’s 
method can be generalized, but it is sufficient! unappealing to discourage such 
a course of action. 

Next, Nicolaus Mercator (1620—1687)—born in the province of Holstein, 
Denmark (now Germany), with the last name Kaufmann but working in 
England—published the quadrature of the hyperbola in his 1668 book Log- 
arithmotechnia: sive methodus construendi logarithmos nova, accurata, & 
facilis. Like Newton, but independently, Mercator based his quadrature on 
the infinit series for the quotient 1/(1 + a), which he gave on page 30 using 
a as the variable. Using this series and referring to the next figure in which 


AI = | and JE is“divisa in partes equales innumeras” of length a, he found 
the start of a series for the sum of the altitudes ps + gt + ru +---, which 
multiplied by a gives the area B/ru [Proposition XVII, pp. 31-32]. Mercator 
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did not explicitly give the logarithmic series, but it can be readily obtained 
from this, and two men showed how to do it. The firs was John Wallis,2° who 
showed that, definin A = /r, the hyperbolic space B/ru is equal to the sum 
of the series 


A—4A?+ 7A3—4AS4 4A, &. 


[p. 754].2” The second commentary on Mercator’s work was published later in 
the same year by James Gregory in his Exercitationes geometric, 1668. He 
devoted the second Exercitatio, entitled N. Mercatoris quadratura hyperbole 
geometricé demonstrata, to prove Mercator’s quadrature of the hyperbola us- 
ing term-by-term integration. Here, in Consectario 2 (what we call corollaries 
he called consectaria, meaning conclusions or inferences) to Proposition ILI 
[p. 11] he gave the logarithmic series as follows. Referring to x as the “primus 
terminus” of his expansion, he stated it as: “primus terminus — 5 secundi [the 
second power of x] + i tertii — 7 quartii + &c. in infinitum. 

The next advance in the computation of logarithms and in determining the 
exact nature of this elusive “Hlog” was also based on work of Newton: his 
generalization of the binomial series to fractional exponents. He stated his 
discovery in a letter of June 13, 1676, to Henry Oldenburg, secretary of the 
Royal Society, in response to an indirect request from Leibniz, who wished 
to have information on Newton’s work on infinit series. It was in this letter 
that Newton introduced the notation a for a number raised to a fractional 
power. In a subsequent letter of October 24,78 he endeavored to explain how 
he arrived at the statement of his theorem, for which he never provided a proof, 
recalling his original work with the aid of some old manuscript. 

He explained [p. 130] how at the beginning of his study of mathematics, 
he “happened on the works of our most Celebrated Wallis,” in particular, on 
his Arithmetica infinitorvm of 1655.7° In this work Wallis had managed to 


26 “| ogarithmotechnia Nicolai Mercatoris,” 1668. 

27 An abbreviated explanation in English was given by Edwards in The historical devel- 
opment of the calculus, 1979, pp. 162—163. 

28 Page references given below for this letter refer to the English translation in Turnbull, 
The correspondence of Isaac Newton, Il. See the bibliography for additional sources. 

29 Wallis was the firs to use of the symbol oo for infinit . It is in Proposition 1 of the 
First Part of De sectionibus conicis of 1655, p. 4, where he stated esto enim oo nota numeri 
infiniti (let oo be the symbol for an infinit number); reproduced in Opera mathematica, 1, 
1695, p. 29. It also appeared in Proposition XCI of Arithmetica infinitorvm of 1655, p. 70, 
in which he stated ... erit oo vel infinitus(... will be oo or infinity) reproduced in Opera 
mathematica, 1, p. 405. Also in Stedall, The arithmetic of infinitesimals. John Wallis 1656, 
2004, p. 71. 
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JOHN WALLIS IN 1658 
Savilian Professor of Geometry at Oxford. 
Portrait by Ferdinand Bol. 
Photograph by the author from the original 
at the Musée de Louvre, Paris. 


fin the areas under the curves that we now express by the equations 
y=(1—x°), y=(1—x’)', y=(1-x??, y= = 2’), 
and so on, from the origin to x > 0, which are, in current notation, 
xy X= 5x3, x — Sx3 ix, x 3x3 + 2x° — 5x", 


: 1 . 
and so on.°° To fin the area under the circle y = (1—x?)2, he noticed that the 
exponent is the mean between 0 and 1, and attempted to use his new method 


30 Tn Wallis’ notation, these can be seen in Proposition CXVIII of the Arithmetica infini- 
torvm. In Opera mathematica, 1, p. 415. In Stedall, The arithmetic of infinitesimals. John 
Wallis 1656, p. 88. 
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of intercalation to fin it between the firs two areas stated above. This led 
him to a most interesting formula for ,3" but in the end he was unable to do 
the intercalation. 

Newton’s power of observation, when he happened on Wallis’ work, made 
him notice that in all the expressions for the areas given above [p. 130],°4 


the firs term was x, and that the second terms ae 143, ae 3x3 &c were 


3 
in Arithmetic progression, and hence 


the firs two terms of the areas to be intercalated, those under the graphs of 
y=(1- x2)2, y=(1- x2)3, y=(1 ~ x22, and so on (why stop with the 
circle), “should be 


x = & > Ree. Ber 


From his study of the area under the hyperbola, Newton was already sure that 
these areas would be given by infinit series, so he had to fin the rest of the 
terms. Wallis’ denominators from the third on are 5, 7, etc., which Newton 
kept for his intercalated series. Next he looked at the numerators of Wallis’ 
coefficients including that of the x term, and discovered that 


these were the figure of the powers of the number 11, namely of these 11°. 
111. 117. 11°. 114. that is firs 1. then 1,1. third 1,2,1. fourth 1,3,3,1. fift 
1,4,6,4,1, &c.38 


The firs figure to use Newton’s own word, in a power of 11 is always 1, and 
what he sought was a method to determine all the remaining figure if given 
the firs two. Here is his own description of the discovery that he made: 


31 Proposition CXCI of the Arithmetica infinitorvm. In Opera mathematica, 1, p. 469. 

321 shall closely follow Newton’s own recollections in his October 24 letter but will 
replace his expression | — xx | with (1 — xx). In his original work of 1665, already quoted 
at the start of this section, Newton expressed himself less clearly than in his recollections. 


33 Newton used the notation Ti’, rather than 11°, and so on for the other powers. Note 
that the word “figure cannot be interpreted to mean “digit” starting with 115 = 161051; 
but it means each of the coefficient of the powers of 10 in the expansion 11° = 1 x 10° + 
5x 104+10x 103+10x 107+5 x 10! +1 x 10°, that is, the Pascal triangle coefficient 
in the expansion of (10 + 1)°. 
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. and I found that on putting m for the second figur [ina power of 11], 
the rest would be produced by continual multiplication of the terms of this 
series. 


m-O m-1l m-2 m—-3 m-—4 
x x x x &e. 
1 2 3 4 5 


—1 


E. g. let m = 4, and 4 x ——— that is 6 will be the third term [fig 


ure], & 6 x ~~ that is 4 the fourth, and 4 x “—™ that is 1 the fifth 


m : ‘ ¢ Fi 2 : 
& 1x that is 0 the sixth, at which place in this case the series ends. 


This procedure gives the figure of 11+. 
Assuming that the same rule applies to fractional exponents, which is a 
stretch to say the least, Newton considered the case m = s, and stated [p. 131]: 
. since for a circle the second term was < I putm = , and the 


terms appearing were 


1 3-1 l 1 4-2 tf. «4 wuss 5 
a og ge a ae Se iG es 
& so to infinit . From which I learned that the desired area of a segment of 
a circle is is 2 - a 
eB Be. Me eg, 
) 7 9 


And by the same reasoning the areas of the remaining curves to be inserted 
came forth... 


The area under the hyperbola was relevant to our discussion of logarithms, 
but would the remaining areas be equally relevant? Not really; it is Newton’s 
next insight that matters: 


But when I had learnt this I soon considered that the terms 


i) 2 4 6 
(l—xx)?. (l—xx)2?2. (l—xx)2. (l—xx)2. &c 


that is 1. 1—xx. 1—2xx+x*. 1 —3xx + 3x4 — x® &c could be 
interpolated in the same way as the areas generated by them [the ones found 
by Wallis] : and that nothing else was required [for this purpose] but to omit 
the denominators 1, 3,5, 7, &c ... 


He forgot to say that we must also divide by x. This is what we now call term- 
by-term differentiation. Newton applied this procedure to his newly found 
areas to obtain the curves themselves. 


134 LoGaRITHMS CHAPTER 2 


nel nn” ‘ite tae ya ta ft eee 


sa +t x5 - 
—K : s — 
| ee £ z i] z y 2 zt x? , tx«*, at 


an % 
Tdlen prayers ad a Fn BS. — ae - eS 3 
“ ee pa con dewnbon y2? Yenoms naloves 1.3. 
Ey PRPs a prey Taare the 2R rumsvate 
foils emsorneg plant wurigatin. Hib saline 
- “ eae ; naa “al 
alee et, he eis peek ti ate as Lpa 
We, We " 18, 6, 4,1. be. Quarham 4 RE Ry i 
be 1,3,3,) st pe dacablos prvmts f he rrwow ossind, 
we 2 pecunda m, vi Gyae i ieebatctts: 
“ae Perea Se lh plcelionan Lenn oan. hajms Serisy, 
mot y ™>3 mod y BaF ve, L gad m= 4, Bxnt yas 
2 me hoc yb 4 gravis, A 4y 23 =? Loe, 
he sla eV KEEE fae oh on 1p l = Bt a 


Hel ekg in ie pete ie 


BY ea ted hum. Wide 
= eta sate 3 oe wo — &~’ ie. ee 


A portion of the firs page of Newton’s October 1676 letter to Oldenburg. 
From Turnbull, The correspondence of Isaac Newton, II. 


Thus as e.g. (1 — xx)? would have the value 


1.4 1,6 
X“ = gx" — 7gx® &e 


and (1 — xx)? would have the value 


1—3xx+3x4+5x° &c 
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and (1 — xx)3 would have the value 


1 


But are these valid expansions or is it all pie in the sky? By way of demon- 
stration, Newton just offered the following: 


To prove these operations I multiplied 


into itself, & it became 1 — xx, the remaining terms vanishing into infinit 
by the continuation of the series. On the other hand, 


twice multiplied into itself also produced 1 — xx. 


From these examples Newton was able to infer and state (in the June 13, 1676, 
letter) a general theorem. If m/n “‘is integral, or (so to speak) fractional” (what 
we now call a rational number), Newton’s theorem stated that 94 


mf(m m(m m 
ay ae 
n\n 2 n\n n a Re: 


m 
l m/n _ 4 ; 
ee ater age 0 bo 


This is now called the binomial theorem. Newton used it, as well as other series 
expansions, in the development of what has become known as the calculus. 
The matter of which logarithm is the one previously referred to as “Hlog” will 
be taken up in the next section. 


34 It is stated here in modern notation. Newton wrote his equation as 


m—2n m—3 
C 
3n Ot 4n 


m m—-— 


m. m m n n 
PLEO Sie AOt— 804+ DQ+ &e, 


in which “I employ ... A for the firs term, P 7 ; B for the second, 7 AQ, & so on.” Putting 
P =1, Q =x, and rearranging yields the stated result. 

Newton did not state the complete theorem until he wrote this letter of 1676, but he had 
already given the form of the general coefficien (which amounts to the same thing) at the 
end of the 1665 manuscript already quoted at the beginning of this section. Using x/y 
instead of m/n, he wrote this coefficien as 


1xxxx-yxx-2yxx-3yxx-4yxx-S5yxx-6by 


&e. 


Ilxyx 2yx 3y x 4y x Sy x 6y x Ty 
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2.6 THE LOGARITHM ACCORDING TO EULER 


The identificatio of the logarithm temporarily labeled “Hlog” in the preced- 
ing discussion was made by Leonhard Euler (1707-1783) of Basel. The son 


LEONHARD EULER IN 1756 
Portrait by Jakob Emanuel Handmann. 


of a preacher and destined to enter the ministry, his ability in mathematics 
soon convinced his father to let him switch careers, and he went on to be- 
came the most prolifi mathematics writer of all time. In 1727, the year of 
Newton’s death, he was invited to join the newly founded Academy of Saint 
Petersburg, in Russia, and soon began producing first-rat research. It was the 
next year, in a manuscript on the firin of cannon, that he introduced a soon to 
become famous number as follows: “Write for the number whose logarithm is 
unity, e,” but he did not give a reason for this choice of letter.°° By that time, he 


35 In “Meditatio in Experimenta explosione tormentorum nuper instituta,” published 
posthumously in 1862, p. 800. The earliest printed appearance of the number e is in Euler’s 
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had already define the exponential and logarithmic functions, but the math- 
ematical community at large had to wait until Euler was ready to publish. In 
1741 he accepted a position at the Academy of Berlin, where he would remain 
for twenty-fi e years, and in 1744 he wrote his enormously influentia treatise 
Introductio in analysin infinitorum. Published in Lausanne in 1748, it became 


INTRODUCTIO 


IN ANALYIYSIN 


INFINITORUM. 


AUCTORE 


LEONHARDO EULERO, 


Profeffore Regio Benouinensi, €9 Academia Im- 
perialis Scientiarum PutROPOLITANA 
Socio. 


TOMUS PRIMUS. 


LAUSANN 4&4, 
Apud MarcuM-MICHAELEM BousQueET & Socios. 


MDCCXLVIIL 


the standard work on analysis during the second half of the eighteenth century. 
In the firs volume of this treatise Euler considered the definitio of a* (at 
this point he chose z as the symbol for a real variable) a trivial matter “easy 


Mechanica sive motvs scientia analytice exposita, 1, 1736 = Opera omnia, Ser. 2, 1, p. 68. 
For English translations of the relevant passages of these works, see Smith, A source book 
in mathematics, pp. 95—96. 
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to understand from the nature of Exponents” [Art. 101].°6 It is not; while the 
meaning of a” was clear after Newton, what does a¥7 mean? Euler simply 
said “a certain value comprised between the limits a” and a” [Art. 97]. In 
short, he was not too clear about it but seemed to have in mind the existence 
of what we now call a continuous extension y = a‘ of y = a”. Accepting 
this as a fact, then he define its inverse function [Art. 102]: 


In the same manner as, given the number a, it is possible to fin the value of y 
from any value of z, conversely, given any affirmat ve [positive] value of y, 
there is a convenient value of z, such that a* = y; this value of z, regarded 
as a Function of y, is usually called LocariruM of y. 


There is, of course, no such thing as the logarithm of y. What there is, instead, 
is one logarithm of y for each choice of the number a, “which, for this reason 
is called base of the logarithms” [Art. 102]. Then, in our present notation, 
a’ = y if and only if log, y = z. However, logarithms as exponents were 
not new when the Jntroductio was published. They had been introduced by 
Euler in the unpublished manuscript cited at the beginning of this section, but 
the firs systematic exposition of logarithms as exponents had already been 
printed—without knowledge of Euler’s work—in the introduction to William 
Gardiner’s Tables of Logarithms of 1742, “collected wholly from the papers” 
of William Jones. 

The success of the /ntroductio rests on the amount and importance of 
the mathematical discoveries that Euler included in it, making it one of the 
most significan mathematics books of all times. Its readers might have been 
bewildered about the fact that there is one logarithm for each base a [Art. 107], 
but Euler easily showed that all logarithms of y are multiples of each other 
[Art. 108]. Indeed, if z = log, y then y = a* and 

log, y = log, a* = zlog, a = log, ylog,a, 3” 
so that any two logarithmic functions, as we would say today, are constant 
multiples of each other. Thus it appears that we need retain only one logarithm, 
and the question is: what should be its base? 


36 References to the Introductio are by Article number rather than by page. In this way 
the reader can refer to any of the available editions cited in the bibliography. 

37 This is a modernized version. What Euler actually did is to show that if M and N are 
two numbers whose logarithms in base a are m and n, and whose logarithms in base b are 
wand v, then m/n = jt/v. Taking M = a, so that m = 1, gives v = ny, and with N = y 
this is the equation stated above. 
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The answer came from Euler’s work in expanding both the exponential 
function y = a*,a > 1, and the corresponding logarithm in infinit series. To 
do that, “let w be an infinitel small number” [Art. 114]. Then, a° = 1 means 
that a® = 1+ w, where yj is also infinitel small. Write y = kw, where, as 
Euler remarked, “k is a finit number that depends on the value of the base a,” 
and then for any number i [Art. 115], 

oe i,t, , i= yy , @-—I)G—2) 
a Ue i aes 1 k*w eo a 
Euler did not give a reason, but we know this to be true by Newton’s binomial 
theorem if i is an integer or a quotient of integers. Now, for any number z let 
i = z/w. For Euler i was “infinitel large,’ but we may prefer to think of w 
as a very small number chosen so that i is a very large quotient of integers. 
Putting w = z/i, the previous equations become 


: kz\' 1 l@-1)_,,, 1@-DG-2) 3; 
z=(14+—) =1+-k ce fp &e. 
: (1+ 5) ic" te foe 
Insisting on the fact that i is infinitel large, Euler stated that [Art. 116] 
i-1 i-2 
=> ile => 


i i 


ko + &e. 


1, 


and so on, which is approximately true if i is very large. Therefore, 


zy kz kz? 23 k+z4 sh 
t—14—~4 c. 
“ f * [eo 159 [295A 


For z = 1, he obtained the following relationship between a and k: 


eee es e a + &c 
"T° 1-25 1-2-3°1-2-3-4 ° 
and then we can ask the question: for which particular base a isk = 1? Clearly, 
this is true for [Art. 122] 


ee a re 
Me hh ro ak reo on bere & temas 
Euler found the sum of this series to be 2.71828182845904523536028 &c. 
Later [Art. 123] he denoted this number by e—the firs letter of the word 


exponential—and gave the series 


a 
ee 7.) ed (eaes 
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Having expanded the general exponential function in an infinit series, 
Euler turned to the logarithmic function, which he denoted by / regardless of 
its base. To obtain its series expansion, he noted [Art. 118] that the equation 


a® = 1+ko yields w = 1(1 + kw) and, consequently, 
io = il(l+kow) =1(1 + ko)’. 


“It is clear, that the larger the number chosen for i, the more the Power 
(1 + kw)! will exceed unity;” that is, for any x > 0 (Euler switched from z 


to x at this point) we can choose i so that x = (1 + kw)! — 1. Thus, 


l+x=(1+ko)! 


and 
11 +x) =l1 +ko)' = io, 
andi must be infinitel large because w is infinitel small. From the definitio 
of x it follows that [Art. 119] 
i ae | 
ey =] fi _ 
iw a +x) k 


and then we can use Newton’s binomial theorem to obtain 


=). 9-9), 


1 Wi 4 es | 
(1+ x) x+ 7) 1223 
1/1 1 1 
She = bil =2(o—3 
tM i i 4, 
hora x &e. 
1 l@-1), 1G@-NHQi-) ,; 
=1+- x == aecae 
i i+ 2i i-2i+3i 
l@-—NHQi-HGBi-1 , 
&e. 
7 oe dens 


(Euler did not write the firs of these two series, but gave only the simplifie 


form.) Since i is infinitel large, 
tf w=) 2 


ae 
“og ~ 3 5° = es 


Ai 
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from which 


i 48 a7 , “4 &e. 


According to the last equation in Article 19, iw is obtained by dividing this 
result by k and subtracting i/k. Then the equation /(1 + x) = iw, established 
at the end of Article 18, shows that 


L/x xx x3 x4 
li4+x)= + ri + &c. }. 


In the particular case in which a = e and k = 1 this becomes [Art. 123] 


Xx XX x x4 
(tas ->t+y-7t & 


This is the same series obtained by Newton and Gregory, which shows that 
the elusive “Hlog” is actually / with base e. Euler called the values of this / 
“Logarithmi naturales seu hyperbolici” (natural or hyperbolic logarithms) 
[Art. 122]. From now on we shall reserve the letter / for natural logarithms. 


Since every logarithm is a constant times /, the values of any logarithm can 
be computed from this series. So, what about log), 2 and log), 5? Is the use 
of this series faster than evaluating 54 square roots of 10 and then 52 square 
roots of 2, as we did when discussing the work of Briggs? In Chapter VI 
of the Introductio [Art. 106], Euler had used a variant of the square root 
method to fin log; 5, obtaining the value 0.6989700, but then, at the very 
end of Article 106, he mentioned the discovery of “extraordinary inventions, 
from which logarithms can be computed more expeditiously.” This refers to 
the use of the series derived above, which he would obtain in Chapter VII. 
However, it is not possible just to plug in, for instance, x = 4, because then, 
as Euler observed, the terms of this series “continually get larger” [Art. 120]. 
Mathematicians knew at that time—or they felt in their bones, in the absence 
of a theory of convergence—that if an infinit series is to have a finit sum, 
then its terms must decrease to zero, so that x cannot exceed | in this case. 
Euler found his way around this obstacle as follows. Replacing x with —x 
in the equation giving /(1 + x) and subtracting the result from that equation 
[Art. 121] yields 
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l1+x)-ld-x)= . + &c. 
x x2 x3 x4 ) 
&e. 

( 1 2 3 4 


=2(7 +5424 a0) 
a ae a le 


Using the properties of logarithms, this is equivalent to [Art. 123] 


l+x 2x 2x3 2x5 2x7 = 2x? 
I = + &e., 38 
is ( 4 6° 9 © ° 


“which Series converges strongly, if x is replaced by an extremely small frac- 
tion.” This device allows us to compute the logarithms of numbers larger 
than 1 from values of x smaller than one. Thus, for x = 1/5, one has 
(1+ x)/( — x) = 3/2, and Euler obtained (except for the numbers in paren- 
theses) 


- 2 A: 2 2 2 * 2 +& 
= Cc. 
2 j|ep. 3252 S52 “Teh? ~ Ox5? 
(= 0.40546510810816438197801310). 
Next, substituting firs x = 1/7 and then x = 1/9, 
ra 2 a 2 2 2 Z 2 +& 
= c. 
a ey Beg? Be9t edt ~~ Oe 
(= 0.28768207245178092743921901), 
and 
5 2 2 2 2 2 
+ + &c. 


47.9 13.931 5.95 7.971 5.99 
(= 0.22314355131420975576629509). 


38 This formula was found firs by James Gregory in his Exercitationes geometrice, 
as Consectario 4 to Proposition HI [p. 12], in which Gregory denoted by H and 4 the 
abscissas that we denote by | — x and 1 + x and by S and 3 the corresponding ordinates. 
Once again referring to x (which he viewed as the area of a certain parallelogram) as “primus 
terminus,” he stated his result as follows: “spatium Hyperbolicum SH 43 = duplo primi ter- 
mini + $ tertii + z quintii + z septimi + 5 nonii + &c. in infinitum. Variants of the 
same formula were also used in logarithmic computation by Newton in his work on fluxion 
and infinit series of 1670—1671 (see Whiteside, The mathematical papers of Isaac Newton, 
III, 1969, p. 227) and by Edmund Halley in “A most compendious and facile Method for 
Constructing the Logarithms,” 1695, pub. 1697. 
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Then, by the properties of logarithms, 


4 
= 5 + ig = 0.6931471805599453094172321, 


3 + 
= > +212= > + 2(15 + 15) = 1.6094379124341003746007593, 


and 
110 =15 +12 = 2.3025850929940456840179914. 


This value, which Euler gave as stated here, may be compared to that obtained 
by Gregory at the end of Section 2.4. Finally [Art. 124], Euler returned to the 
computation of common logarithms and showed that if the hyperbolic loga- 
rithms are divided by the hyperbolic logarithm of 10, the common logarithms 
will be obtained. He did not give an example, but we shall show one, while 
simplifying his explanation (and reducing his accuracy) in the process. Using 
the already established equation log, y = log, y log, a with b = e, y = 2, 
and a = 10, we obtain 


(2) _ 0.693147180559945309 
1(10) —_2.302585092994045684 


log) 2 = = 0.301029995663981195. 


This is approximately the same value provided by the square root method, and 
it was in this manner that the natural logarithm became indispensable. 

Now that the logarithm is a function, we can ask whether it has a derivative 
and how to fin it. The series found above allowed Euler to fin it as follows 
in his book on differential calculus [Article 180]:°9 


We put x + dx in place of x, so that y is transformed into y + dy; whereby 
we have 


d 
ytdy=I(x+dx) & dy =Iex dx tay =1(1 +S), 
x 


As above the hyperbolic logarithm of this kind of expression | + z can be 
expressed by an infinit series, as 


(4 y= - t &e. 


39 Institutiones calculi differentialis, 1755. This quotation is from Blanton’s translation. 
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; . dx . 
Therefore if we substitute — for z, we obtain: 
x 


dx dx* dx? 
dy = &e. 
a) Ue 3 © 
Since all the terms of the series vanish in front of [meaning: compared to] 
the firs term, it will be 


dx 
d.lx =dy = —-: 
X 
In short, the derivative of /x is 1/x. Of all the logarithms this is the only one 
with such a neat derivative. For if 


log, x = log, x log, e = Ix log, e, 


then it is clear that 
log, x = — log, e 
x 


is not quite as good-looking as the derivative of /x. We conclude that the 
logarithm to keep is the natural logarithm, which is a very appropriate name 
for it. 

So, what about the logarithms of negative numbers? In 1712 and 1713 
a dispute over this had flare up in the correspondence between Gottfried 
Wilhelm Leibniz (1646-1716) and Johann Bernoulli (1667-1748). Leibniz 
had published an article4° expressing his opinion on the logarithm of —1 
[p. 167]: 


Indeed it is not positive, for such numbers are the Logarithms of positive 
numbers larger than unity. And yet it is not negative; because such numbers 
are the Logarithms of positive numbers smaller than unity. Therefore the 
Logarithm of —1 itself, which is not positive, nor negative, is left out as not 
true but imaginary. 


An unfortunate choice of word (imaginarius), because it has a clear meaning 
in today’s mathematics. What Leibniz meant is that /(—1) does not exist. 
Bernoulli disagreed, stating his opinion that 


Ix =1(—x)*" 


40 “Observatio, quod rationes sive proportiones non habeant locum circa quantitates nihilo 
minores, & de vero sensu methodi infinitesimalis” 1712. 

411 have inserted parentheses where he had none, and will continue this practice to the 
end of this chapter for the benefi of the modern reader. However, the original notation of 
Euler, who did not use parentheses around expressions such as —x and +x, is kept in the 
quotations. 
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in a letter to Leibniz of May 25, 1712. He gave four reasons for it, the firs 
being that forx > Othe differentials ofboth /x and/(—x) are identical, and then 
the stated equation follows [pp. 886—887]. This was not accepted by Leibniz 
in his reply of June 30 because he believed that differentiating /x is legitimate 
for x > 0 only [p. 888]. Euler had also exchanged some correspondence 
with Bernoulli on this subject from 1727 to 1731, but, although he remained 
unconvinced by Bernoulli’s arguments, he had no alternative theory of his own 
to propose at that time. 

The controversy between Leibniz and Bernoulli resurfaced in 1745 when 
their correspondence was firs published. By this time Euler had found the 
solution: 1(—a) = la + 2(1 + 2n)./—1, where n is any positive integer, and 
presented it in a letter of December 29, 1746, to the French mathematician 
Jean le Rond d’Alembert (1717—1783).42 But d’ Alembert responded: 


However, although your reasons are very formidable and very learned, I 
admit, Sir, that Iam not yet completely convinced, because ... 


at which point he stated three of his own reasons for this lack of conviction, but 
we know now that they are invalid.** In fact, d’Alembert sided with Bernoulli, 
convinced by Bernoulli’s fourth stated reason: that (—x)* = x? implies that 
21(—x) = 21x and, once more, 


l(—x) = I(+x). 


Euler replied on April 15, 1747, but d’Alembert remained unconvinced and 
Euler eventually gave up the argument. Instead, on August of that year he 
sent an article, Sur les logarithmes des nombres négatifs et imaginaires (On 
the logarithms of negative and imaginary numbers), to the Academy of Berlin 
to give his solution; but he may have withdrawn it later because it was not 
published until 1862! 

However, he presented another complete solution in 1749.44 With both 
correspondents now dead, Euler felt free to express his own opinion. First, he 
explained how Bernoulli’s firs reason was wrong because it was [p. 144; 200] 


42 Tn a previous letter of September 24, Euler had communicated the value of /(—1) to 
Gabriel Cramer as (7 + 2mm) - /—I1. See Euler, Opera omnia, Ser. 4a, 1, Birkhauser 
Verlag, Basel, 1975, R. 469, pp. 93-94. 

43 Euler’s solution and d’ Alembert’s reply can be seen in Euler, Opera omnia, Ser. 4a, 5, 
Birkhauser Verlag, Basel, 1980, pp. 251—253 and 256-259. 

44 “Te la controverse entre Messrs. Leibnitz et Bernoulli sur les logarithmes négatifs et 
imaginaires,” 1749, pub. 1751. Page references are to the original paper firs and, after a 
semicolon, to the Opera omnia. Quotations are from the original paper. 


146 LoGaRITHMS CHAPTER 2 


clear, that since the differential of /—x & of /-++x is the same a , the quantities 


—x and /-+x differ from one another by a constant quantity, which is equally 
evident, in view [of the fact] that /—x =/—1+1/+ x. 


Then he pronounced himself against Leibniz’ belief that the logarithm of —1 
does not exist [p. 154; 208]: 


Because, if /—1 were imaginary, its double, i.e. the logarithm of (—1)? = +1, 
would be too, which does not agree with the firs principle of the doctrine of 
logarithms, according to which it is assumed that /+1 = 0. 


However, Euler had a greater difficult disposing of Bernoulli’s fourth 
reason, because it rests on the belief that for any power p” it is true that 
I(p") = nlp. But accepting this leads to contradiction [p. 147; 202]:4° 


Because it is certain that (a,/ — 1)* = a‘, thus we’ll also have /(a./ — 1)4 = 
la*, & furthermore 4 1(a,/ — 1) = 41a, consequently I(a./ — 1) = la. 


Putting a = 1 leads to /,/—1 = 0, and this is impossible because Euler was 
already aware of the fact that (this will be explained in Section 3.5) 

1 l/-1 

eine eae 
2 V-1 
But rejecting Bernoulli’s belief that /(—1) = /(+1) = 0 also leads to contra- 
diction [p. 148; 203]: 


To make this more evident, let /—1 = w, & if isn’t w = 0, its double 2m will 
not be = 0 either, but 2m is the logarithm of the square of —1, and this being 
+1, the logarithm of +1 will no longer be = 0, which is a new contradiction. 


These contradictions cannot be allowed to stand or the “enemies of Mathe- 
matics,” as Euler called them [p. 154; 209], would have a fiel day. Not to fear. 
He found the source of these contradictions in a very insidious assumption that 
mathematicians, including himself, implicitly made [pp. 155—156; 210]: 


it is that one ordinarily assumes, almost without noticing, that each number 
has a unique logarithm ... Therefore I say, to make all these difficultie 

& contradictions disappear, that just as a consequence of the given defini 

tion each number has an infinitud of logarithms; which I will prove in the 
following theorem. 


45 The square root of —1 is shown here in Euler’s original notation. 
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The theorem just restated that each number has an infinitud of logarithms. 
And how would he fin the additional logarithms? By widening the search. 
Now that ./—I has become involved, it is not just the logarithms of negative 
numbers that we must seek, but those of complex numbers as well. 


COMPLEX NUMBERS 


3.1 THE DEPRESSED CUBIC 


In his Artis magne, sive de regvlis algebraicis (a title usually shortened to Ars 
magna) of 1545, Girolamo Cardano (1501-1576) set out to split the number 
10 into the sum of two numbers whose product is 40. Thus, «(10 — x) = 40, 
which has the solutions 5 + ./—15 and 5 — /—15. And indeed, Cardano 
invited his readers to consider that the product 


5+ /-15 times 5 — /—15, disregarding the cross products, makes 25 — 
(—15), whichis + 15, therefore this product is 40.1 


This was the firs time that the square root of a negative number was used 
in computation. However, this was not a motivation for the introduction of 
complex numbers. The given equation is a quadratic, and the fact that it has 
no real solutions is, and was then, a commonplace. Mathematicians of the 


| The original passage in Cardano’s own notation reads: “5 p: R m:15 in 5 m: Rm: 15, 
dismissis incruciationibus, fi 25 m:m: 15, quod est p: 15, igitur hoc productum est 40,” 
and the Latin words translated above as “disregarding the cross products” are “dismissis 
incruciationibus.” The translation by Witmer has a different interpretation of these words. 
Since the Latin cruciare means to torture, this passage is rendered as: “Putting aside the 
mental tortures involved, multiply 5 + ./—15 by 5 — ./—15 making 25 — (—15) which is 
+15. Hence this product is 40.” See p. 219 of the Dover Publications edition. Witmer’s 
translation of this problem and solution is reproduced in Calinger, Classics of Mathematics, 
1995, p. 265. For another English translation plus a photographic reproduction of the Latin 
original, see Struik, A source book in mathematics, 1200-1800, 1969, pp. 67—69. 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 148 
DOI 10.1007/978-0-387-92154-9_3, © Springer Science+Business Media, LLC 2011 
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MATIC! PHILOSOPHY AC MEDIC, 


ARTIS MAGNA, 


SIVE DE REGVLIS ALGESRAICIS, 
Lib.unus. Qui & cotius operis de Arithmetica, 
OPVS PERFECTVM 
infcripfit,eftin ordine Decimus, 


itentntaeta Algebraicas (Tali, dela Cof 
fa uocant) nouis adinuentionibus,ac rationibus ab Authore ita 
1pletatas,ut pro pauculis ancea wulge tritis.iam feptuaginta euaferint. Nex 

Sale , ubiunus numerus alteri,aut dito uni,uerim etiam,ubi duo duobus, 
a tres uni ¢quales fuerint,nodumexplicant. — Huncaétlibrumideo feore 


Title page of the Nuremberg edition of Cardano’s Ars magna 
From Smith, Portraits of Eminent Mathematicians, Il, 1938. 


sixteenth century would have been content with branding the new equation 
“impossible” or with simply saying that it has no solution. 


In the sixteenth century quadratic equations were old hat. Abraham bar 
Hiyya Ha-Nasi (1070-1136), of Barcelona, one of the twelfth-century trans- 
lators, better known as Savasorda, made the complete solution known in his 
book Hibbur ha-Meshihah ve-ha-Tishboret (Treatise on Measurement and 
Calculation), the earliest algebra written in Europe, later translated into Latin 
by Plato of Tivoli as Liber enbadorum in 1145. But the cubic equation was 
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a hot topic and everybody was trying to fin a method of solution. Umar 
ibn Ibrahim al-Khayyami (1048-1131) had already been able to solve cubics 
geometrically by the method of intersecting conics, but the Europeans were 
interested now in a purely algebraic solution. 

By the end of the fifteent century no one had succeeded, but sometime in 
the second decade of the new century, a professor of arithmetic and geometry 
at the University of Bologna, Scipione di Floriano di Geri Dal Ferro (1465— 
1526), learned how to solve the particular case x? + px = q, where p and q 
are positive numbers. This equation is called the “depressed cubic” because 
it lacks the term in x7. 

Dal Ferro never published his solution and kept his knowledge to himself. 
Mathematics was then as magic is today: you kept your secrets to yourself 
and impressed others with the results, in this case with your ability to solve 
equations. Furthermore, mathematicians could make some extra money— 
and enhance their necessary prestige to keep their nontenured positions—by 
challenging competitors to public contests in which they posed each other 
problems. However, shortly before his death Dal Ferro confide his secret to 
one of his students, Antonio Maria Fior of Venice. Much later, Fior—a poor 
mathematician himself—had the audacity to challenge none other than Nicold 
Fontana (c. 1499-1577) of Brescia to a cubic-solving contest, to take place on 
the 22nd of February, 1535. But Fior was not as good at keeping his secret 
as Dal Ferro had been, and there was a circulating rumor that he knew how to 
solve x? + px = q. 

Fontana, usually known as Tartaglia, “the stammerer”’—as a consequence 
of an injury inflicte by a French soldier when he was a child—was one 
of the great intellects of his time. Born in poverty, he was self-educated in 
mathematics, and in this very same year, 1535, he had discovered how to solve 
another type of cubic, x? + px* = q. Now he applied himself to the task of 
rediscovering the solution of the depressed cubic, and succeeded eight days 
before the contest. Each contestant then submitted thirty cubics to the other, 
to be solved within the next fift days. Actually, it was no contest. Tartaglia 
solved all of Fior’s depressed cubics in just two hours, but Fior could never 
solve any of Tartaglia’s, who had chosen the type x* + px” = q. 


3.2 CARDANO’S CONTRIBUTION 


As a youngster Cardano persuaded his father to let him enter the University 
of Pavia—his hometown—to study medicine, earning his doctorate in this 
discipline in 1525 at the University of Padua, but when denied membership in 


SECTION 3.2 CARDANO’S CONTRIBUTION 151 


QVESITILETINVENTIONID, | 
VERSE DE NICOLO TARTALEA 
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Frontispiece of the princeps edition of the Qvesiti, Venice, 1546. 
Reproduced from the virtual exhibition E/ legado de las matematicas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla, 2000. 


the College of Physicians in Milan—due to the fact that he was an illegitimate 
son—he had difficult making ends meet, and by 1533 things had gotten so 
bad that he had to pawn his wife’s jewelry. Happily enough, he was also 
proficien in mathematics, astrology, and gambling. In this last discipline he 
wrote the book Liber de lvdo aleae (Book on games of chance), probably 
about 1563, and with this expertise plus no aversion to cheating he won more 
than he lost. He also came out ahead in astrology, for, although imprisoned 
in 1570 for casting the horoscope of Jesus, he was later hired as astrologer 
to the Pope. Eventually, he became a successful physician all over Europe, 
with a reputation for achieving miraculous cures. Earlier in his life he had 
become lecturer in mathematics at the Piatti Foundation in Milan, a post that 
he resigned in 1540. 
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GIROLAMO CARDANO 
From Smith, Portraits of Eminent Mathematicians, Il. 


It was during his tenure at the Piatti that Cardano heard of the cubic-solving 
contest, and he wasted no time in trying to convince Tartaglia to reveal his 
secret. He succeeded in 1539, but only under the promise of secrecy, and 
he received the solution without proof. But Cardano was able to supply his 
own, and later realized that Tartaglia’s solution was the same as the one he 
eventually found in Dal Ferro’s surviving papers. Somehow this findin made 
Cardano feel free to publish it in the Ars magna, while giving proper credit to 
both Tartaglia and Dal Ferro. He started Chapter XI with these words: 


Scipio dal Ferro of Bologna about thirty years ago invented the rule in this 
chapter, and passed it on to Antonio Maria Florido of Venice, who when 
engaged in a contest with Niccolo Tartaglia of Brescia, gave Niccol6 occasion 


2 Alternative translations can be found in Cardano, Ars magna; Witmer, Ars magna or 
the rules of algebra, 1993, p. 96; reproduced in Calinger, Classics of Mathematics, p. 263; 
Smith, A source book in mathematics, 1929; reprinted by Dover Publications, 1959, p. 204; 
and Struik, A source book in mathematics, 1200-1800, p. 63. 
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to discover it; and he gave it to us because of our requests, but suppressed the 
demonstration. Armed with this help, we sought its demonstration in various 
forms, which was very difficult and write out as follows. 


Of course that stole the thunder from Tartaglia, who wanted to be the firs to 
eventually publish it. He claimed later, in his Qvesiti et inventioni diverse of 
1546, that he had given his proof to Cardano. In any event, let us proceed to 
Cardano’s demonstration, shown as firs published in the next two pages. 

But before we embark on the details of the proof it is necessary to explain 
the difference between Cardano’s writing, which would be very difficul to 
follow today, and our current style. We have written the depressed cubic as 
x> + px = q instead of Cardano’s De cubo & rebus equalibus numero (on 
the cube and the thing equal to the number). In his time there was no habit 
of denoting either the unknown or the coefficient by letters or of obtaining 
general formulas. Instead, they taught and learned by example and their 
descriptions were mostly verbal. 

Cardano started his proof by example under the heading DEMonstRATIO. 
Referring to his own figure shown on the next page, he stated the equation 
to be solved as follows: “Then let for example the cube of GH & six times 
the side GH equal 20,” which can be written as x°7 + 6x = 20 if we use the 
letter x instead of GH.° Then, without interruption, he stated a proposition 
that is the most important step in solving the equation. It can be translated as 
follows and will be explained immediately following the translation: 


& set two cubes AE & CL [meaning cubes built on square bases with diago- 
nals AE and CL], whose difference be 20, such that the product of side AC, 
and side CK, be 2, the third part of the number of things [“thing” meaning 
the unknown GH], & cutting off CB, equal to CK, I say that, if this were 
to be, the remaining line AB, is equal to GH, & is therefore the value of the 
thing, for GH has already been assumed, that it was [the thing]. 


Since the second term in the equation is 6x, there are six “things” and then 
“the number of things” is 6. The rest of the proposition simply states that 
if two segments AC and CK are chosen such that AC? — CK? = 20 and 


3 Semiverbal ways of writing equations were already in use, although there were no 
universally accepted norms. For instance, in folio 30 r. (see the reproduction on page 155), 
Cardano wrote this equation as “cub? p: 6 reb? z@lis 20,” which reads “cubus plus six rebus 
equals 20.” The Latin word res, meaning “thing,” or a variation of it was frequently used 
for the unknown. It was then translated into Italian as /a cosa, and later algebraists became 
known as “cossists.” 


154 Complex NUMBERS CHAPTER 3 


Hisronymt CAarDANt 
-elincritur prima 6 m:rz 30$,hz autem quantitates proportionales 
Fine, cquadtranum fecundz eft aquale duplo producti fecunds in 
primam,cum quadruplo primzx,ut proponebatur, 


De cubo & rebus xqualibus numero, Cap, XI. 


Cipio Ferreus Bononienfis iam annis ab hine triginta fer 
mé capitulun hoc inuenit , tradidit uero Anthonio Mas 
ail ri Florido Veneto, qui cil in certamen cit Nicolao Tare 

yee talea Brixcllenfe aliquando uenifler, occafionem dedit, ut 
Nicolaus inuenerit & ipfe,qui cum nobis rogantibus tradidiffer, fi up 
prefla demonttratione, freti hoc auxiho, demonftrarionem guzcliui« 
mus,eamg in modos, quod difficilmum fuit, redactam fic fubiecis 
mus. DeMONSTRATIO. 

Sicigitur exempl: caufa cubus ¢  & fexcuplum lateris ¢ # aqua 
le 20, ponam duos cubos a & X ¢ 1,quorum differentia fit 20, ita 
Guod productum a ¢ laterss, inc x latus, 
fit 2, tertia fcilicee numeri rerum pars , & 
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ita fucrir,lineam aB refiduum , effe equas 
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p Fcubumae,per D A triplum ¢ Bin quadratum A B,per D & triplum 
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€ numcrus rerum, igitur cx AB yp triplum ACinc K flunté res AB, 
feu fexcuplum 4 B,quare triplum producti ex AB, BC,A C, elt fexcue 
plum a 8,at uero differentia cub1 ac, acubo cK , & exiftenti a abo 
Bc c1xGleex fuppofito,elt 20,& ex {uppofito primo 6’ capituh: , eft 
aggregatum corporum D A,D E,D F,triaigicur hxc corpora funt 20, 
pofira ucro 8 c m:cubus A B,aquahs eft cubo a c,& triplo a cinqua 
dratum ¢ BS cubo 8 c m:& criplo #¢ in quadratum 4c m: per des 
monftrata ilhc, differentia autem triph 8 c in quadratum ac, a triplo 
acin quadrarum s c eft productum a B,B C,A c,quare cum hoc,ut de 
monftracum eft,cquale fit fexcuplo a B, igitur addhro fexcuplo az, 
ad id quod firex ac in quadratum pc ter, fict triplum cin quadrae 
fusn A ¢,cum igitur g ¢ fit m:iam oftenfum eft,quod productum cp 
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A BC 


CARDANO’S SOLUTION OF THE CUBIC 
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De AritunericaA Lik x. 30 


in quadratum A c ter,eft m:& reliquum quod c: aquatur eft prigitur 
triplum c 8 in Gdratum a 8,& triplum a c in Gdracu c 6, & fexcuplit 
asnihil faciunt. Tanta igicur eft differentia,ex cOmuni animi {enter 
tia,ipfius cubi a c,d cubo 8 c, quantum eft quod cSflatur ex cubo Ac, 
& triplo a cin quadratum c 8,& triploc Bin quadracum acm:X cu 
bo pn cm:& fexcuplo a 8,hoc igictur eft 20, quia differentia cubi a c,a 
cubo c 8, fuit 20,quare per fecundum fuppofitum 6" capituli , pofira 
Bcm:cubus AB aquabitur cubo ac, & triploac in quadraium Bc, 
& cubos c m:& triplo 8 cin quadratum a ¢ m:cubus igitur 4 B,cum 
fexcuplo a B,per communcm animifententiam , cum sxquecur cubo 
ac&triplo ac in quadratum c B, Xtriplo cB in quadracum a B an; 
& cubo c m:& fexcuplo as, quwiam wquatur 20, uc probatum 
eft,aequabuntur etiam 20,cum igitur cubus ap & fexcupluin as a 
quentur 20,& cubus c »,cum fexcuplo ¢ nequentur 20,crit ex com 
muni animifententia,& ex dictis,in 3° p'& 3 1° undecimi clemenroz 
rum,c Haqualis aB,igitur cueltdiffcrentia ac& ce, funtautem 
ac&c B,uclac&ck,numeri fculinia continences fuperficiem , xs 
qualem tertix parti numeri rcrum,quarum cubi differtnt in numero 
xquationis,quare habebimus regulam. 
REGvLA. 

Deducito tertiam partem numeri rerum ad cubum , cui addes 
quadratum dimidi numeri xquationis,& rotius accipe radicem, {cili 
cet quadratam,quam feminabis,unigs dimidium numeri quod iam 
in {c duxeras,adrjcies,ab altcra dimidium idem minucs,habebisq Bi 
nomium cum fua Apotome, inde detracta mR cubica Apotome cx re 
cubica fui Binomij,refiduit quod ex hoc relinquitur,eft rei ¢ftimatio. 
Exemplum.cubus & 6 pofitiones, aquans 


° yas Immlics 
tur 20,ducito 2 , tertiam partem 6 ,ad cus cub? p:6 reb’ aqlis 20 


bum, fit &,duc 10 dimidium numeri in fe, = 29 
fit 100,iunge 100 & &,fic 108, accipe radi- 8 — 10 
cem qua elt R: 108, & eam geminabis, alte 108 
riaddes 10,dimidium numeri,ab altero mi rz 108 p10 
Rz2108m:10 


nues tanrundem,habebis Binomit' R 108 
p:10,& Aporomen R: 108 m:10, horum 
accipe Re cub” & minuc illam que eft Apo 
toma, ab ea quz eft Binomi, habebis rci aftimationcm, F v: cub: 12 

108 p:80 mk V: cubica Fz 108 m: 50. 
Aliud,cubus p: 3 rebus aquetur 10,duc 1,tertiam partem 3, ad 
cubumm, fit 1 ,duc 5,dimidium 10,ad quadratuim, fic 25,iunge 25 a, 
H 2 hunt 


REV: cu.R 108 piso 
MIR Vici. 108 mito 


From Smith, History of Mathematics, IL, 1925, pp. 462—463. 
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AC x CK = 2, then GH = AB. Thus x = AB, which will later make solving 
the equation a simpler matter. 

For the sake of generality and simplicity, we shall make some changes 
in Cardano’s presentation from this point on. First, we shall keep p and gq 
instead of 6 and 20. Then if, as shown in the next figure we write u = AC 


u 


and v = BC = CK, it is clear that AB = u — v, and the proposition can be 
restated as follows: if u and v are positive numbers such that 


uu—v =q and uv = 


then wu —v = x. Thus we see that if the two preceding equations can be solved 
for u and v in terms of p and gq, then x can found as the difference of u and v. 

Returning for a moment to Cardano’s original statement, note its geomet- 
ric fl vor. The unknown GH is the side of a square, and the demonstration 
will be purely geometric. Algebra as we know it was in its infancy, but the 
accomplishments of Greek geometry were fresh in the minds of mathemati- 
cians of the Renaissance, and it is to geometric methods that they turned for 
inspiration and geometric results that they tried to emulate. 

However, while keeping the geometric character of the demonstration, 
we shall rearrange Cardano’s own argument to shorten it and to avoid using a 
proposition from his earlier Chapter VI. Then, using a bit of three-dimensional 
imagination, think ofa cube of height u erected on the square with diagonal AE 
shown in the previous figur (we can think of the cube’s height in a direction 
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perpendicular to the page). Its volume, w°, is equal to the sum of the volumes 


of the eight solid bodies listed in the next table. 


Solid Base diagonal Height Volume 

Cube FD u-—v  (u—v) 

Cube DC v v3 
Parallelepiped AD u—v (u—v)*v 
Parallelepiped DE u—v (u—v)*v 
Parallelepiped FD v (u— v)?v 
Parallelepiped AD v (u — v)v? 
Parallelepiped DE v (u — v)v* 
Parallelepiped DC u—v w(u-—v) 


Then 
w=(u—vptv4+3(u—v)*v + 3(u — v)v* 
=v4+(u—v)P+ (u—v)[3@— v)v + 307] 
=v4+(u—v)> + 3uv(u — v). 


Therefore, (uw — v)? + 3uv(u — v) = uw? — v or, recalling the definitio of u 
and v, 
(u—v) + pv) =4. 


Comparing this with the given cubic x7 + px = q shows that u—v isa solution. 
This is the end of the demonstration proper, which Cardano expressed with 
the words: “therefore GH is the difference of AC and CB” (igitur GH est 
differentia AC & CB).4 


What remains now is to solve the equations 
w—vw=q and ww == 


for u and v. But Cardano skipped this step, simply saying “whence we have 
the rule” (quare habebimus regulam) and proceeding to give the solution in 


4 Algebra may have been in its infancy in Cardano’s time, but it has been grown up for 
some time now. A nongeometric demonstration is very simple: expanding (u — v)? as 
uw —3u*v +3uv* — v° and rewriting the right-hand side as u? — 3uv(u — v) — v provides 
at once the equation that took a number of solid bodies to assemble in the sixteenth century. 
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the general case. Today we can see that adding the square of the firs equation 
above to four times the cube of the second and simplifying gives 
4p? 
3,32 2, *P 
(ue tu) =q t+ a 
Finding now the square root of both sides and solving the resulting equation 


simultaneously with u* — v? = q yields 
2 3 2 3 
3 q P q 3 q Pp q 
a a ae! d = ee ae 
. A gy oe 4°37 2 


Finally, extracting cube roots and putting x = u — v gives Dal Ferro’s formula: 


ie op oe. Sg og 
x= )J—+—4 yt 
4° 39" 3 4 3 


usually known today as Cardano’s formula.* Clearly, this formula gives a 
real solution of the equation, in fact the only real solution, because, under 
Cardano’s assumption that p is positive, the derivative of x3 + px —q is positive 
everywhere. Application of this formula to Cardano’s example x* + 6x = 20 


yields the solution 
x= ¥V108+4+ 10— al Vi08 —10, 


although in an inconvenient form, since it would take a little time to figur out 
that this simplifie to x = 2.° 


5 Cardano’s actual rule, stated under the word Recuta. on page 155, is: “Cube the third 
part of the number of things, to which you add the square of half the number of the equation, 
& take the root of the whole, of course the square [root], which you will duplicate, to one 
of them add half of the number which you just squared, from the other subtract the same 
half, you will have a Binomium and its Apotome [these terms are from Book X of Euclid’s 
Elements], then subtract the cube root of the Apotome from the cube root of its Binomium, 
the remainder that is left from this, is the value of the thing.” 

8 This formula can be seen in Cardano’s notation in the last two lines inside the box on 
page 155. Cardano used the letter R with its tail crossed for root (radix), and the V: (the 
initial of vniversalis) next to it means that it is to be applied to the entire expression that 
follows, while “cu.” indicates that it is a cube root. This cube root applies to the sum of 
the square root of 108 (only of 108, since there is no V:) plus 10, using the symbol p: for 
plus. This explains the firs of these two lines, and the second has a similar explanation 
but replacing p: with m: for minus. In this explanation we have followed Smith, History of 
Mathematics, Il, p. 416. Actually, the location of the V: in Cardano’s writing is confusing, 
at least for English-language speakers, since it is easy to interpret the two symbols before 
the number 108 as “cube root,” while in Latin the proper order is radix vniversalis cubica. 
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Cardano’s work goes beyond that of his predecessors in that he was able 
to solve the general cubic, not just the depressed case. But Cardano is rather 
long-winded on this subject. The fact that negative numbers were not accept- 
able motivated him to divide this case into many subcases, all with positive 
coefficients which he studied in Chapters XVII to XXIII, stating a number of 
rules to follow in each subcase and providing individual geometric demonstra- 
tions [Ars magna or the rules of algebra, pp. 121-154]. However, from our 
point of view all this work boils down to making the substitution x = y — b/3 
in the general cubic x7 + bx? +x +d = 0, which reduces it to the depressed 
cubic y* + py = q with 

b? be 2b 
p=c-—> and q=>- os 
3 
This provides the complete solution of the cubic. 

Cardano must also be credited with having noticed that cubics have three 
solutions and that their sum is —b, the opposite of the coefficien of x”. Af- 
ter giving three examples numbered 5 to 7 in Chapter XVIII, he made the 
following statement: ’ 


From this it is evident that the number of x?’s,® in the three examples in which 

there are three solutions for x, is always the sum of the three solutions: as in 

the fift example, where 2 + J/2, 2, and 2 — /2 make up 6, the number of 
x2’s9... Hence, knowing two such solutions, the third always emerges. 

But before we can pronounce the cubic solved let us admit that there is a 
hole in the bucket. We have not considered the case in which p < 0, and we 
do not know whether the stated formula is valid in that case. In Chapter XII 
Cardano studied the cubic x7 = px +q with p,q > 0, and found the solution 
to be much like that for x7 + px = q except for replacing the + sign under 
each square root with a — sign. This is equivalent to saying that the solution 
stated above is also valid for p < 0. But then, what happens if 


3 
+? <0 


ane 
4 27 


7 With minor modifications this is Witmer’s translation in Ars magna or the rules of 
algebra, p. 134. 

8 In Cardano’s arrangement the x? term is on the right-hand side of the equal sign, so 
there is no need for a negative sign. He would have been horrifie by such a need. 

9 Tn Cardano’s own words and notation, “ ... uelut in quinto exemplo, 2 p: 2, & 2, & 
2 m: &2, componunt 6, numerum quadratorum,” f. 39”. 
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In such a case, Cardano’s formula was meaningless in his time, and there 
was no way to proceed. Tartaglia limited himself to calling such cubics “irre- 
ducible,” but it is too bad to think of them in these terms because it is easily 
shown that all depressed cubics with p < 0 that have three real solutions are 
irreducible. Indeed, a straightforward application of calculus shows that if 
p < 0, then the expression x* + px — q has the maximum value 


3/2 
u =2(-2) —q for x=- -> 


But it should be clear from the figur below that if x* + px — q has three real 


y=xi+ px—q 


roots, then M > 0. Thus, 


3/2 2 3 
7 <2(- 2) or Tt <0. 


Is Cardano’s formula powerless to compute one real solution of a cubic that 
has three? 


3.3. THE BIRTH OF COMPLEX NUMBERS 


Rafael Bombelli (c. 1526-1572), a hydrologist from Bologna who made his 
fame in mathematics, pursued this matter further, demonstrating that the for- 
mula of Scipione Dal Ferro is valid in every case. He did this in his Opera su 
l’algebra, written about 1550 in two parts. Only the purely algebraic part (the 
second part is based on geometric constructions), in three books, was printed 


SECTION 3.3 THE BIRTH OF COMPLEX NUMBERS 161 


| as CT RK ox 
LALGEBRA 
PAR TB MAGGIORE 
DELLARIMETICA 


DI RAFAEL BOMBELLO 
BOLOGNESs, 


First page of Book II of the 1579 edition of Bombelli’s L’Algebra. 
Except for the title page and the dedication, it is identical to the 1572 edition. 
Reproduced from the virtual exhibition El legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla. 


for the firs time in 1572, and then again in 1579, under the title L’Algebra. 
Opera di Rafael Bombelli da Bologna, divisa in tre libri. 

As an example of the casus irreducibilis, Bombelli considered the cubic 
x? = 15x +4, which has the positive solution x = 4, found by inspection. 
Bombelli used Cardano’s rule to solve this cubic in the 1550 manuscript, as 
shown here. The equation itself is written in the center of the second line as 


3 tL L 
lal5p.4. 


Here the symbol ~ represents the variable, the superscript on it is the power 
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Reproduced from the preface to the 1966 edition of L’Algebra. 


to which it must be raised, and the number under ~ is the coefficien of that 
power (this is the origin of the exponent notation that we presently use). The 
letter a stands for the equal sign (either as the initial of the Latin equalis or 
as a contraction of the Italian eguale a) and p. is for plus. He gave Cardano’s 
solution of the cubic on the eighth line, to the right of the fancy summary 
box and the word fara. He followed Cardano in using the letter R with its tail 
crossed to indicate a root, and there are four of these in the formula. Those with 
a3 on top are cube roots and the other two are square roots. The underboxing 
indicates the range of each root. With p meaning plus and m meaning minus, 
this solution can be rewritten using current symbols as 


#24. J0— 121 + 2 — YO— 121. 


This is, indeed, the solution provided by Cardano’s rule, because rewriting 
the cubic as x? — 15x = 4, we have p = —15 (adopting modern usage of 
negative numbers, inconceivable to Cardano) and g = 4. Then, g7/4 = 4 and 
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p> /27 = —125, and the solution provided by Cardano’s rule is 


FSV e/4S 58 = V4 185 = 2, 


which is equivalent to that stated by Bombelli. 

Cardano might have stopped right at this point, but Bombelli stated that 
the sum of his two cube roots is 4 and “that is the value of the Thing,”'° the 
value found by inspection. This may be a surprising statement in view of 
those square roots of negative numbers, so it would be of interest to reproduce 
Bombelli’s reasoning in arriving at such a conclusion. But before we do that 
it is convenient to prepare the ground, as he did well before this point. On 
page 169 of the 1572 edition he explained how to deal with square roots of 
negative numbers, such as those that arise in the irreducible case. To avoid 
using negative coefficients sucha cubic and Cardano’s formula for its solution 
(the one from his Chapter XII) can be rewritten as x = px +q with p > 0 


and 
las jg pala je PP 
x= + 
2 4 27 2 4 27 


Of course, Bombelli wrote this solution in narrative form at the start of the 
Capitolo di Cubo eguale a Tanti e numero (Chapter of the Cube equal to the 
Unknowns and number) [p. 222]:" 


Wanting to equate the cube to the Unknowns and number [that is, considering 
the equation x? = px + q] take one-third of the Unknowns and cube it and 
the result is subtracted from the square of half the number, and of what 
remains the square root is taken which is added to and subtracted from half 
the number, and of the sum and difference the cube root of each of these is 
taken, and these two roots together are the value of the Unknown (as will be 
seen in the examples below). 


The following is a translation of Bombelli’s groundbreaking proposal 
[p. 133], but some words remain in Italian because they need additional ex- 
planation. It is provided below. 


10 At the bottom of the summary box we read Somma 4 : et tanto uale la Cosa. In 1550 
Bombelli still used the word cosa for the unknown, but in preparing the manuscript for 
publication he introduced a number of changes, and one of them was to discard this term. 
In print he used Tanto (so much) instead of Cosa for the unknown, giving the reason for it 
at the start of Book II, as shown on page 161: “Tanto is a word appropriate to the quantity 
of numbers,” while Cosa “is common to every substance whether known or unknown.” 

11 Page references are to the 1966 edition. 
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I have found some other kind of extended cube root [cube root of a sum] that 
is very different from all the others, which has its origin in the chapter of the 
cube equal to the unknowns and number, when the cube of one-third of the 
unknowns [the cube of p/3] is greater than the square of half the number 
[when p?/27 > q?/4], ... the excess cannot be called either plus or minus, 
but I shall call it pia di meno when it must be added, and when it must be 
subtracted I shall call it men di meno... 


Actually, he did not mean the excess. He meant the square root of the excess, 
that is, any of the two square roots in the previous equation. In fact, the terms 
piu di meno and men di meno are contractions of pitt radice di meno | and 
men radice di meno 1; that is, “plus the root of minus 1” and “minus the root 
of minus 1.” In our symbols, pid di meno means + /—1 and men di meno 
means — ./—I. Later, he further abbreviated pia di meno to p. di m. and 
men di meno to m. di m. Thus, Bombelli accepted and introduced imaginary 
numbers in this passage and created a notation for them. Translating his into 
ours, if a is areal number, p. di m.a means + ai and m. di m.a means — ai. 

He devoted the rest of Book I to developing the arithmetic of complex 
numbers, starting with the most elementary rules of multiplication for “real” 
and “imaginary” numbers, which he called /a regola del piti et meno. If we 
agree that via means “times” and fa means “makes” or “equals,” it is simpler 
to quote them in Italian with a symbolic representation on the right [pp. 133— 
134]. 


Pit via piu di meno, fa piu di meno (+i) = +i 
Meno via piu di meno, fa meno di meno —(4+i) =-i 
Pit via meno di meno, fa meno di meno +(-1) = -i 
Meno via meno di meno, fa piu di meno —(-i) = +i 
Piu di meno via pitt di meno, fa meno (+1)(47) = -1 
Piu di meno via men di meno, fa pit (+i)(—i) = +1 
Meno di meno via piti di meno, fa piu (—i)(4+i) = +1 
Meno di meno via men di meno, fa meno (—i)(-i) =-1 


Next he gave a definitio that includes that of what we now call the con- 
jugate of a complex number [p. 134]: 


Notice that when we say the Residue of a Binomial [what Cardano called 
Apotome], what is called pia di meno in the Binomial, will be called meno di 
meno in the Residue. 


Thus, if the Binomial is a complex number, then its Residue is what we now 
call its conjugate. 
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Bombelli then went on to explain, by a wealth of examples rather than by 
stating a rule, how to multiply complex numbers. Omitting, as trivial, how 
to multiply a complex number by a real number, we give one of his many 
examples of complex multiplication. We have changed the original notation 
as follows: we write </ instead of R.c. and +./—_ instead of pia di meno R.q. 
Except for these changes in notation, his example is as follows [p. 135]: 


Multiply Vv 3+./—-5 by a 6 + /—20, to do it start similarly [referring to 
the previous, simpler example] by multiplying +./—5 by + /—20, which 
will be — 10, then we multiply 3 times 6 which makes 18, which together 
with — 10 makes + 8, and then we multiply 3 times + —20, makes 
+ ./—180, and then we multiply 6 times + /—5, makes + /—180, which 
together with + ./—180 makes + ./—720, and this together with + 8 and 
with the removed cube root makes ¥/8 + /—720 which is the result of the 
multiplication. 12 


It is clear that the cube roots are spurious here. Other than this cube root 
fixatio of Bombelli’s, this is a pure example of complex multiplication. After 
many more examples [pp. 135-140], only the dimmest of Bombelli’s readers 
could fail to get the hang of complex multiplication. Addition is, of course, 
also used in the preceding computations in the obvious way: add the “real” 
and “imaginary” parts separately. 

Perhaps surprisingly, the next operation that Bombelli considered is the 
extraction of cube roots. Ina chapter entitled Modo di trovare il lato Cubico di 
simil qualita di Radici (A way to fin the Cube side ofa similar kind of Root), 
he gave a general rule (to be illustrated by examples, surely, but a general rule) 
to extract the cube root of a composite number that has a pitt di meno part. To 
be precise, he gave a general procedure to fin the cube root of an expression 
of the form a + ./—b. Referring to a as “the number” and to Vb as “the square 
root,” his method, entirely in narrative form, is as follows [pp. 140-141]: 


Add the square of the number to the square of the root and of this sum extract 
the Cube root, then search by trial-and-error to fin a number and a square 
root such that when their squares are added together they amount to as much 
as the cube root mentioned above and such that when we subtract from the 
cube of the number three times the product of the number times the square 
of the square root, what remains is the number of the cube root that we seek, 


12 Asan example of Bombelli’s equation writing, the last equation in this quotation appears 
in the original as R.c. |8 p. di m. R.q. 720]. 
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and then proceeded to give his firs example without even writing a period. But 
before we present it we shall interpret this statement in current terminology. 


Basically, we set 
va +V¥-b=x+/-y 


and then fin the unknown x and y by trial and error but subject to the following 
constraints: 


wrty= Vaz+b and x? —3xy =a. 


Bombelli did not give any reasons for doing this. His readers would have been 

more interested in the fact that the rule works in example after example than 

in how he found it. But we know today that the firs constraint results from 

equating the absolute values of both sides of the previous equation, and the 

second constraint from equating the real parts of the cubes of both sides. 
Bombelli’s firs example was to fin 


¥2+ /=121, 
in which case the constraints become 
ety= f/224121=5 and x? — 3xy =2. 
Then he started his trial-and-error method as follows [p 141]: 


Now it is necessary to fin a number [x] whose square is smaller than 5 and 
whose cube is larger than 2, which if we assume that it is | the square root 
[./y ] will by necessity be the square root of 4, which squares added together 
make 5 and the cube of the number is | and the product of this number with 
the square [y] of the square root [,/y] makes 4 which tripled makes 12, 
which cannot be subtracted from the cube of the number that is only 1, 
therefore | is no good [as the value of x], nor can 3 be good because its square 
surpasses 5,... 


The next logical choice was x = 2, and then both constraints are satisfie by 
choosing y = 1. Therefore, the desired cube root is 2+ /—1. 

Bombelli was working only with integers in his firs and second examples, 
but his third was a little more adventurous. He proposed to fin the cube root 


of [p. 142] 
8+ ,/-232 8. 
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in which case the stated constraints become 


x +y= f84+2328=62 and x? —3xy=8. 


The trial-and-error method must yield a number x whose square is smaller 
than 6 5 and whose cube is larger than 8. Then he said that x = 2 “is no 
good” because its cube is not larger than 8 and that x = 3 “is equally no good” 
because its square is not smaller than 6 x. Bombelli continued: 


therefore it is necessary to fin a quantity that is larger than 2 and smaller 
than 3, that /2 + 1 has this property, that its square, which is 3 + 8 is 
smaller than 6 z, and its cube is 50 + 7, which is larger than 8. Now let us 
see whether it satisfie the rest: square /2 + 1 makes 3 + 8 and subtract it 
from 6 :, it remains 3 z — ,/8 and this must be 


the value of y. Routine computation shows that this choice of x and y satisfie 
the second constraint. "9 

The next chapter deals with division, and includes the following example 
(written as a quotient in today’s manner) [p. 145]: 


10 


V2eO/—1)ll 


Bombelli’s solution consists in cubing both “parts,” numerator and denomi- 
nator. Then 


we multiply the denominator by 2— (./—1)11, its residue, makes 125, which 
divided into 1000 it becomes 8 and this is multiplied by 2 — (./—1)11 makes 
16 — (/—1) 88, 


13 It can be argued that Bombelli did not guess that x = /2+ 1, that he must have posited 
this value and that of y and then obtained the original number from them. But this does not 
invalidate his trial-and-error method if a sufficient! good approximation is wanted instead 
of an exact root. To show this using decimal notation, not yet available in Bombelli’s time, 
start by trying x = 2.5. Then x? = 6.25, y = 0.416, and x? — 3xy = 12.5 > 8. Since 
the cube is dominant in the last equation, this suggests that 2.5 is too large a value for x, 
and we try x = 2.4 next. This gives x? = 5.76, y = 0.906, and x3 — 3xy = 7.296 < 8, 
suggesting that 2.4 is too small a value for x; but it seems to be closer to the true value 
than 2.5. Three additional attempts will produce the required approximation: x = 2.42 
yields x? — 3xy = 8.289952 > 8, x = 2.414 yields x3 — 3xy = 7.989335776 < 8, and 
x = 2.4142 yields x7 — 3xy = 7.999322685 © 8. Thus, x © 2.4142  /2 +1. 
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and restoring the cube root, we have Fe 16 — (./—1) 88. Disregarding once 
again the intrusive cube root, this is a perfect example of the procedure: to 
divide complex numbers, multiply both numerator and denominator by the 
conjugate of the denominator to make the operation trivial. 

The next four short chapters [pp. 147-150] are devoted to addition and 
subtraction, and contain the unavoidable sprinkling of cube roots. It is not 
necessary to give examples, since these operations have already been used in 
the preceding ones. The remaining four chapters of Book I [pp. 151-154] do 
not involve complex numbers. 

Book I of L’Algebra is devoted to the study of algebraic polynomials and 
to the solution of equations of the firs to the fourth degree. It is here that, 
as we have already seen, Bombelli named the variable i/ tanto and that he 
introduced a half-circle '* surmounted by an integer for the power to which 
it is raised. A remarkable contribution by Bombelli is the complete treatment 
of all possible 42 cases of fourth-degree equations, with which he concluded 
Book II [pp. 268-314]. In this aim at complete generality, he can be considered 
a precursor of Viéete. 

In this second book, Bombelli freely used complex numbers to solve equa- 
tions, starting with second-order equations. In the Capitolo di potenze e nu- 
mero equali a Tanti (Of the square and the number equal to the Unknowns), 
he considered the equation x” + 20 = 8x, solving it by stating that (the reader 
should keep in mind the formula for solving a quadratic equation) [p. 201] 


the square of half of the Unknowns is 16, which is smaller than 20 and this 
equation cannot hold except in this sophistic manner. Subtract 20 from 16 it 
remains —4, its root is + (./—1) 2, and this is subtracted and added to one- 
half of the Unknowns, which will be 4+ (./—1) 2 and the other 4— (./—1) 2, 
and any of these quantities by itself will be the value of the Unknown. 


But Bombelli’s greatest achievement was the solution of the cubic in the 
irreducible case.'5 Once in possession of his method of cube root extraction, 
he could easily solve the cubic x? = 15x +4. He gave the details of Cardano’s 
rule, but no longer had to explain how to fin the cube roots, since that had 
been done in Book I. The following translation is not from the manuscript 
page shown above but from the printed edition, which was intended to be the 
fina version [p. 225]. 


14 For typographical reasons, it was rendered as ~ in the printed version of the book. 
15 The study of all kinds of cubic equations is contained in pages 214 to 268 of Book II. 
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Equate x? to 15x + 4; take one-third of the Unknowns, which is 5, cubed it 
makes 125 and this is subtracted from the square of half the number, which 
is 4, it remains — 121 (which will be called pia di meno) which taking the 
square root of this it will be (./—1)11, which added to half the number makes 
2 + (./—1)11, of which having taken the cube root and added to its residue 
makes 2 + (./—1)1 and 2 — (./—1)1, which joined together make 4, and 4 
is the value of the Unknown. 


Bombelli admitted that this solution may seem extravagant to many, and 
that he himself was of the same opinion at one time, but that he had a demon- 
stration of the existence of a real root. In fact, on page 298 of the 1572 edition 
he gave the following geometric proof on a plane surface [pp. 228-229]. 

In Bombelli’s time, letters were not used as coefficients and he used the 
cubic x? = 6x + 4 in his proof, but we shall reproduce his argument very 
closely using x = px +4q instead. He chose firs a segment q (not the q in 
the equation), shown in the next figure of length one, for which we (and he) 


h } 


will have no further use. Then draw the half-line me (he denoted it by .m.e., 
using dots around the letters in all his segment notations). On it, choose / 
such that m/ = 1, and then f such that /f = p (Bombelli used p to denote 
one of the square rulers shown in the figure) Over this /f draw a rectangle 
abf whose area is g. Now extend ab to a half-line ad and al to a half-line ar. 
Then take two square rulers (just think of them as right angles), and place the 
firs so that its angle is at a point i on the half line ar and one of its arms goes 
through m. Lower or raise it in such a way that if the segment if is extended 
until it touches bd at c, and if the second square ruler is placed with its angle 
at c and one arm on da, then the intersection of the remaining arms at g is on 
me, 


and this done I say that the segment from the point ./. to the angle of the 
[first square ruler is the value of the Unknown and prove it in this manner. 
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Using fractions instead of narrative, the similarity of the triangles i/m and gli 
yields 


and denoting i/ by x and with m/ = 1, we obtain g/ = x”. Then the area of the 
rectangle ilg will be x° and the area of the rectangle i/f will be px because 
il = x and/f = p. The rectangle fg has the same area as the rectangle 
alf, which is q (because—but Bombelli did not explain this, thinking it was 
obvious—the triangle iac has the same area as the other right triangle with 
hypotenuse ic, the triangles i/f and ihf also have equal areas, and so do the 
triangles fbc and fgc). Thus, the area of the rectangle ilg, which is x°, is 
the sum of the area of the rectangle i/f, which is px, and the area of the 
rectangle hfg, which is g. We conclude that x* = px + q and that i/ = x is 
the solution of the proposed cubic. 

Bombelli found only one real solution in his examples of the form x7 = 
px +4q, but some of them have three real solutions. The remaining two are 
easily found by factoring. For example, knowing the solution x = 4 of x? = 
15x+4, we fin the other two by writing x9 —15x-4 = «#-—4 (x? +404), 
which Bombelli could do just as easily in his own notation, and then solving 
the quadratic x? + 4x + 1 = 0 to obtain the remaining roots: —2 + J/3. 

The cubic was a hot topic, as has already been said, and the fact that a 
real solution can be obtained from numbers with an imaginary part is what 
motivated the mathematical community, albeit rather reluctantly, to start using 
complex numbers. 

However, a different solution of the cubic, which does not use square roots 
of negative numbers, was provided in 1591 by Francois Viete on the basis of 
his formula for cos 30. It is in his firs treatise in De e@qvationvm recognitione 
et emendatione tractatvs dvo,'® as an alternative Theorem III. We reproduce 
the part of Viéte’s statement that is of interest to us [p. 174]: 


ANOTHER, 
THIRD THEOREM. 


If A cubed — B squared 3 in A is equal to B squared in D, while B is greater 
than half of D {that is, if AX — 3B*A = B?D and D/2 < B)... 


16 Page references are to the English translation by Witmer in The analytic art by Francois 
Viete, 1983. 
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And there are two triangles of equal hypotenuse B, such that the acute 
angle subtended by the perpendicular of the first is triple the angle subtended 
by the perpendicular of the second; and twice the base of the first is D, and A 
is twice the base of the second. 


Admittedly, Viéte seems to be talking in riddles. But this is only a tempo- 
rary impression, since this result bears interpretation. In the given cubic A is 
the unknown, so we can call it x. Then, if we write p = 3B? and q = B*D, 
the cubic becomes x? = px + q. Next, the hypothesis D/2 < B becomes 


q 3q ‘2 
=> < ; 
2B 2p 3 


which, squaring and simplifying, reduces to 


q 14 


ea en 
4 27 
In short, we are dealing with the irreducible depressed cubic. 

Then, to obtain the apparently cryptic solution, start by dividing the original 
cubic, A? — 3B7A = B’D, by 2B°. The result can be written as 


AS A D 
4 ) 2 ne 
2B 2B 2B 


But this brings to mind the equation 


4cos* 6 — 3cos@ = cos 36, 


which is just a rearrangement of cos 39 = cos* @—3 cos 6@ sin’ 6 after replacing 
sin? 6 with 1—cos* 6. This was easy for Viéte, who had already discovered this 
identity, as we saw in Chapter 1. Now acomparison of the stated trigonometric 
identity and the cubic equation of the theorem becomes inescapable. The two 
equations are one and the same if 


0 e d 30 2 
COST a an C0830 = 5 
This is what Viéte expressed by means of his two triangles, which will be of 
more help if they are actually drawn, as shown below. 
The solution of the cubic is now very clear. The condition D/2 < B 
means that D/2B is a valid cosine, and then 30 can be obtained from a table 
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D/2 A/2 


of cosines. Once @ is known, the solution is A = 2B cos @. Or, if we prefer to 
write the cubic as x7 = px + q and if we note that 
D q q/2 


2B 2B (p/3)3/? 


then the solution is 


q/2 


[p 1 
x = 2,/—cos (; arccos ) 
3 3 fp? /27 


The last quotient is a valid cosine because g*/4 < p?/27, that is, precisely 
because the cubic is irreducible. 

Using an electronic device rather than a table of cosines, we can re-obtain 
Bombelli’s solution of x? = 15x + 4 as follows: 


2 79.695 15353° 
= 2/5 cos ( —————"_ ] = 4 
a) ( 3 ) 


Note that the angles 360° + 79.69515353° have the same cosine, and can be 
used in place of 79.69515353° in the stated formula to obtain the remaining 
real roots: —3.732050808 = —2 — /3 and —0.267949192 = —2+ /3. 

It is fortunate that Viéte’s solution without imaginaries did not appear until 
1615. Had this result been published in the sixteenth century, it might have 
delayed any interest and investigation into the nature of complex numbers. 

Bombelli’s notation, p. di m. and m. di m., was not subsequently adopted. 
The notation that would survive, ./ —a, was introduced by Albert Girard 
(1595-1632), while giving the solutions of the quartic x4 = 4x — 3 as les 
quatre solutions seront 1, 1, —1 + ./—2, —1 — /—2."" This notation was 
adopted by many mathematicians, including Newton, Wallis, and Euler. As 


1 
x = 2/5 cos (; arccos 


17 Invention nouvelle en l’algébre (The new invention in algebra), 1629, ff. FA and FB. 
See Cajori, A history of mathematical notations, II, 1929, p. 127. 
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for the word, “imaginary” that we use to this day, it was originally introduced 
by René Descartes (1596-1650) in La Géomeétrie of 1637 as follows [p. 380]: 


Neither the true nor the false [negative] roots are always real; but sometimes 
only imaginary. 1S 


For a long time this would be the word to describe what we now call complex 
numbers. Acceptance was one thing but understanding was slower in coming, 
and mathematicians were puzzled by the meaning of imaginaries. Newton 
referred to imaginary roots as radices impossibiles,'9 and Leibniz, in a more 
literary mood, expressed himself as follows: 


Therefore, an elegant and admirable way out is found in that miracle of 
Analysis, that portent of the ideal world, almost Amphibian between Being 
and non-Being, that we call an imaginary root.29 
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More than a century and a half passed since the publication of Bombelli’s 
l’Algebra until a method to compute the nth roots of complex numbers was 
made available. It was developed by Abraham de Moivre (1667-1754), a 
French mathematician who resided in England for religious reasons since the 
revocation of the Edict of Nantes protecting Huguenots. 

It appeared in a paper of 1739,2' in which he considered four problems. 
The firs does not refer to complex numbers, the second deals with the extrac- 
tion of cube roots, and the third was “To extract the Root, whose Index is n, 
of the impossible Binomial a + /—b.” Note that n must be a positive integer 
and that b > 0 if the “impossible binomial” is what we now call a complex 


18“Ay reste tant les vrayes racines que les fausses ne sont pas tousiours reelles; mais 
quelquefois seulement imaginaires.” In The geometry of René Descartes, translated by 
Smith and Latham, 1954, pp. 174-175. Also in Hawking, ed., God created the integers. 
The mathematical breakthroughs that changed history, 2005, p. 345. 

19 Arithmetica universalis, 1707, p. 242. 

20 «Ttaque elegans et mirabile effugium reperit in illo Analyseos miraculo, idealis mundi 
monstro, pene inter Ens et non-Ens Amphibio, quod radicem imaginariam apellamus.” In 
“Specimen novum analyseos pro scientia infiniti circa summas & quadraturas,” 1702 = 
Opera mathematica, p. 378 = Gerhardt, Leibnizens Mathematische Schriften, Sec. 2, 1, 
No. XXIV, p. 357. 

21 “De reductione radicalium ad simpliciores terminos, seu de extrahenda radice qua- 
qunque data ex binomio a + +5, vel a + /—b, Epistola,” 1739. 
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ABRAHAM DE MOIVRE IN 1736 
Portrait by Joseph Highmore, engraving by John Faber. 
From Florence Nightingale David, Games, Gods and Gambling. 
Charles Griffin London and Hafner Publishing Co., New York, 1962. 


number. Then he stated his solution, which needs more than interpretation, as 
follows [p. 475]: 


Let that Root [the nth — ofa+/—b |bex+./—y; thenmaking Yaa +b = 


m; and also making ” —— = p, describe, or imagine describing, a circle, 
n 


nt 8 F : . . a 
whose Radius is ./m, in which take some arc A, whose Cosine is — let 
m 


C be the whole circumference. Take to the same Radius, the Cosines of the 
Arcs 


A C-A C+A 2C—A 2C+A 3C-A 3C+A 


’ 
n n n n n n n 


> &. 


Until the number of them [the Cosines] equals the number n; this done, 
thereupon; these Cosines will be so many values of the quantity x; as for the 
quantity y, it will always be m — xx. 
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Let us admit at the outset that this is just a statement without proof, possibly 
because de Moivre had just presented a fully detailed discussion of the partic- 
ular case n = 3 as his second problem, and felt entitled to omit the details in 
the general case. 

In any event, the equation definin p seems to contain a typo. If, instead, 
we defin p = (n — 1)/2, we can interpret the result in present-day notation. 
If we write z =a +i/b and recall the definitio of m, then 


lz] = Va? +b= Vm" 

and 

a a 

arg z = arccos — = arccos ——- 

|z| f/m" 
For de Moivre, as for many of his predecessors and contemporaries, the Cosine 
of an arc A was what we now denote by R cos A, where R is the radius of the 
circle, in this case ,/m. Then, his choice of A is such that 


/m cos A = E 
mP 


and it follows that 


a a 
om = arecos — iy = arg. 
This clears up the nature of the arc A, and by C de Moivre must have meant 
2x, not the whole circumference of the circle with radius ./m. 

There is a shortcoming and a fl w in de Moivre’s solution. Only complex 
numbers in the upper half-plane are of the form a + ./—b if b > 0, so that the 
roots of a—./—b must be found by a similar method. This is the shortcoming, 
and the fl w is to assume that all the roots of a number in the upper half-plane 
are of the form x + ./—y with y > 0. Actually, some of the roots are of the 
form x — ,/—y, as shown by the simple case of a+ ./—b = 0+ /—I. In this 
case, A = Sm, and n = 3, and the three arcs in de Moivre’s method are 


A = arccos 


5 Qn —tn 3 In +'n 5 
rae 2 — ay and eae eo ep 


3 6 3 6 3 6 


The cosine of the second arc is zero, which yields x = 0 and y = 1— x? = 1. 
This gives the complex number 0+ ./—1, which is not a cube root of 0+ ./—1. 
Instead, the desired cube root is 0 — ./—I (in our terms, —i is a cube root 
of i). De Moivre could have easily avoided these problems with a change of 
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notation; for instance, using a + b,/—1 instead of a + ./—b and allowing b 
to be negative. 

The preceding explanation notwithstanding, it is a fact that de Moivre never 
stated the nth roots of a complex number in a fully explicit form. Euler got 
closer when ten years later, in 1749, he submitted a paper in which he posed 
a more ambitious problem than that of de Moivre:24 


PROBLEM | 


79. An imaginary quantity [meaning complex] being raised to a power whose 
exponent is any real quantity, find the imaginary value that results. 


His method of solution is based on a formula that he had proved in his 
Introductio,?3 so we present this first Once he was done with series expan- 
sions for exponentials and logarithms in Chapter VII, he turned his attention to 
“transcendental quantities derived from the circle” in the next chapter. In Arti- 
cles 127 to 131, Euler introduced the notation that he would use—essentially 
the modern notation—and then gave a number of trigonometric identities. 
Next he evaluated the product [Art. 132] 


(cos.y +./— 1. sin. y)(cos.z + ./— 1. sin. z) 
= cos. ycos.z — sin. ysin.z + ./— L.(sin. ycos.z + cos. y sin. z) 
= cos.(y +z) + /— L sin. (y + 2), 


which we have reproduced keeping his notation but abbreviating his writing, 
and similarly, 


(cos. y—/— 1. sin. y)(cos.z—/— l.sin.z) = cos. (y-+z) —/— 1. sin. (y+z). 
These give [Art. 133] 

(cos.z+ ga l.sin.z)? = cos.2z+ fee 1. sin.2z, 
and, in general, the formula 


(cos.z + /— L.sin.z)” = cos.nz + /— l.sin.nz 


22 “Recherches sur les racines imaginaires des équations,” 1749, pub. 1751. References 
will be by article number so that any of the references listed in the bibliography can be used. 
The stated problem and its solution are in Art. 79. In Art. 93 he considered the general 
problem of raising a complex number to a complex power. 

23 Introductio in analysin infinitorum, 1748. We shall refer to the Introductio by article 
number rather than by page, so that the reader can refer to any available edition. 
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for any positive integer n. 

If we are permitted a digression that will tie some loose ends in our ear- 
lier discussion of trigonometry, Euler had stated the following in the second 
paragraph of Chapter VIII [Art. 126]: 


Let us therefore stipulate that the Radius of the Circle or total Sine be = 1, 
. let 2 = the Semicircumference of the Circle whose Radius = 1, or 
[equivalently] z will be the length of the Arc of 180 degrees.”4 


It was with these words that Euler’s imposing mathematical authority sealed 
the value of the trigonometric sinus totus to be 1 from that point on.2° How- 
ever, he was not being original in this selection, for it had already been made 
by Abu’! Wafa and Joost Biirgi, but it is almost certain that Euler was not aware 
of this. With this choice of the radius equal to 1, the trigonometric lengths 
effectively became trigonometric ratios, and soon, in the rest of Euler’s work, 
trigonometric functions. In this manner, trigonometry disappeared as an in- 
dependent branch of mathematics to become, as in the rest of this section, a 
part of mathematical analysis. 

Before moving ahead, several comments are in order about the last equa- 
tion. First, it should be pointed out that Euler was the firs to consistently 
write ./— 1 for the imaginary unit, and that later he introduced the current 
symbol i for it.2° It will be convenient to use this new symbol from now on 
even if Euler himself did not until much later, and, since the periods in the 
abbreviations for sine and cosine are no longer in use, we shall omit them 
from now on and rewrite the previous equation as 


(cos z tisinz)” =cosnz +i sinnz. 


Finally, it turns out that this formula was known to de Moivre, although he 
never wrote it explicitly like this. The closest he got is as follows (to be 
explained below):2” 


24 “Ponamus ergo Radium Circuli seu Sinum totum esse = 1, ... ita ut sit 7 = Semi- 
circumferentiz Circuli, cujus Radius = 1, seu z erit longitudo Arcus 180 graduum.” 

25 4 complete translation of Articles 126, 127, 132 and 133 is included in Calinger, 
Classics of Mathematics, pp. 494—495. 

26 With the words “formulam \/ — 1 littera i in posterum designabo” (in what follows I 
denote the formula ,/—1 by the letter 7) in “De formulis differentialibus angularibus maxime 
irrationalibus, quas tamen per logarithmos et arcus circulares integrare licet,” presented to 
the Academy of Saint Petersburg on May 5, 1777, but firs published in his book Jnstitvtionvm 
calcvli integralis, 2nd. ed., IV, 1794, p. 184. 

27 “De sectione anguli,” 1722, p. 229. 
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Let x be the Versed Sine of any Arc whatever. 
t the Versed Sine of another Arc. 
1 the Radius of the Circle. 
And let the former Arc be to the latter as 1 ton, Then, assume two Equations 
which may be regarded as known, 


t= 22" + zon —22"t 
1—2z7+2z = —2zx. 


Eliminate z, there will arise an Equation by which the Relation between 
x & t will be determined. 


Regardless of the possible meaning of z (it is not essential for our purposes), 
recall that the versed sine of g is 1 — cosy. Then let x = 1 — cosg and 
t = | —cosng. Solving the quadratic equations 


1— 227 +2" =—22"(1—cosng) and 1—2z4+ 27 = —2z(1 —cosg) 


for z” and z gives 


z" =cosng+,/cos*ng—1 and z=cosy+,/cos?g—1. 


It follows that 
(cosy tising)" =cosng +i sinng. 


Because it was implicitly contained in de Moivre’s work, this equation has 
been known as de Moivre’s formula since the nineteenth century. 

But of course, Euler went further to generalize the validity of this equation. 
After restating the fundamental equation as 


(cosy +ising)” = cosmy +isinmge 
in his 1749 paper [Art. 85], he added: 


But that the same formula is valid also, when m is any [real] number, differ- 
entiation after having taken logarithms will show it beyond a doubt. 


What this shows is that both sides of de Moivre’s formula have the same 
derivative with respect to gy, and since they are equal for y = 0 they must be 
equal for all g. Thus, de Moivre’s formula is valid for any real exponent, and, 
what is more, Euler used it. 
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He had begun the solution of Problem 1 in Art. 79 by writing (a + bi)” = 
M + Ni, and his goal was to fin M and N. First he put 


J (aa+ bb) =c, 


and then he stated: “let us look for an angle g such that its sine is = b/c and 
the cosine = a/c,” which if found would allow us to write 


a+ bi =c(cosy +isin@). 


Here g can be replaced by y + 2kz, k any integer (Euler did not write the k, he 
Just spelled out a few of these angles and then wrote “etc.”’), because their sines 
and cosines “are the same” as for gy. Then, using de Moivre’s formula (whose 
general proof he postponed until Art. 85), the proposed power becomes 


(a+ bi)” =c"(cosy +ising)” = c”"(cosmy +isinm®@). 
From this he concluded that 
M =c"" cosm@ and N=c"sinmg. 


After findin this solution, Euler stated fi e corollaries. The firs two are 
useless today, and the third stated that if m is an integer then (a+ bi)” has only 
one value because all the angles involved (namely, y + 2k7) have the same 
sine and cosine [Art. 82]. The fourth [Art. 83] is of interest to us, asserting 
that ifm is of the form jz/v, where y and v are positive integers, then (a+ bi)” 
will have exactly v values, which essentially solves the problem of findin the 
roots of complex numbers. If m = 1/n, where n is a positive integer, and if 
we combine all the possible values of M and N in a single formula (which 
Euler did not do), we have 


Vath = te (co [ oa peg ) 
n 

Equivalently, this formula can be written using y — 2kz instead of g + 2kz. 
In either case, the only distinct values are fork = 0,1,...,n —1. Finally, the 
fift corollary [Art. 84] states that if m is irrational, (a + bi)” “will have an 
infinit number of different values.” 

Now we can return to Cardano’s formula, but choosing the cubic from 
Bombelli’s proof for the Cubo equale a Tanti e numero, namely x* = 6x +4. 
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This is easier for hand calculation than x7 = 15x + 4, and when solved by 
Euler via Cardano’s rule yielded the solution 2° 


ga aT 4/27. 


At this point, Euler added: “which cannot be expressed otherwise.” This is 
surprising because it is easy to express it “otherwise” using the formula just 
derived to fin complex roots. Ifa+bi = 2+2i, itis clear thatc = /4+4 = 
J/8 and cosy = sing = 2//8 = 1/V2, so that y = 2/4. Then 


ee oe 78 = V2, 
and the three cube roots of a + bi are 
V3 (cos + isin =) 
Vi (cos 7° == isin “2 =) — v3 ( “ i SI =), 
and 
Vi (cos "2 = + isin 7°) = Vi (cos a +iSi in ) 


Similarly, but using gy = —2/4, y — 27, and g — 47, the three cube roots of 


2 —2i are 
4 1 
—vVJ/2 i Si ; 
vi( 12 =) 


and 


17 17 17 17 
oe (- 1) oom) -vaon —e 


Hence, the three solutions provided by Cardano’s formula are 


els uae me ; ca ae _ 
Vi (cos 5 + isin =) + V3 (cos 7 isin =)= a2 


28 Volistiindige Anleitung zur Algebra, 1770, Second Part, First Section, Chapter 12, Ar- 
ticle 188. 
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3x 3 3 

Vi (cos =F + isin) + V3 (cos — isin} = W200 = ~2, 
4 4 4 4 

and 


17x 17 
V2 (cos 


TU 17x 
in —— i sin —— } = 22 cos —_- 
3 +i sin | + V3 (cos i isin ) /2 cos 2 
If we now use the formula cos(a + 8) = cosa cos 6 + sina sin B, we obtain 
cos 7 = 0s (3 +) = LWP 4 V3N2 = EW? 5 
12 — 3 4 22 2 2 
and 


oo) -8C9)- SE) 


and then the three solutions of x> = 6x +4 become 1 + /3, —2, and 1 — V3. 
All three are real, and this represents a triumph for imaginary numbers. From 
this moment on they could not be denied. 


3.5 THE LOGARITHMS OF COMPLEX NUMBERS 


To lay the ground for obtaining the logarithms of complex numbers, we re- 
turn to the Introductio. In Article 134 Euler deduced the following series 
expansions for the sine and the cosine, using the letter v for the variable: 


v? v4 v® 


= + &e., & 
ee an er wee er mead 
vp v v! 
inv = &c.29 
Se ee (sods Aen o 


A few paragraphs later, Euler gave formulas for cos v and sin v in terms of 
exponentials, and from these he deduced his now famous formula [Art. 138] 


e'” =cosu+isinv, °° 


29 This was unnecessary, since these series were obtained by Newton in 1665 and firs 
made public in 1669 in De analysi per eequationes numero terminorum infinitas. The sine 
and cosine series are on page 17 of Jones’ edition and on pages 236 and 237 of Whiteside’s 
1968 edition. They appeared in De analysi for the firs time in Europe, although they were 
known in India since the fourteenth century (see Section 4.3). 


30 Since it is famous, here it is in Euler’s original notation: et?Y-! = cos.v + /-1. sin.v. 
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which is extremely useful in many practical applications. It is universally 
known today as Euler’s formula or Euler’s identity. In proving this result 
Euler was original because he was not aware of an equivalent result that was 
obscurely embedded in the one and only paper published by the English math- 
ematician Roger Cotes (1682—1716),°" the man of whom Newton said: “If He 
[Cotes] had lived we might have known Something.” °? 

The argument that Euler used to deduce this formula in the /ntroductio is 
rather dirty. Perhaps no less dirty but definitel briefer is a third proof (his 
second requires knowledge of integral calculus) that he gave in 1749.9° Here 
he put an imaginary number ig in place of the real variable x in the series 
previously obtained for the exponential function, taking it as an act of faith 
that [p. 166; 219] 


(ig (ivy (ig)? (ig)? 

ig =] + ete. 

7 eT] OV age ge tededaes 
=1l+i e i e | ¢ i e etc 
See Ta ies” Wilnsed eles Aes 


Actually, Euler did not write e’” as we have done. Rather, in the vein of what 


317 ogometria,” 1714. The words “obscurely embedded” are explained by the form in 
which the formula appears on page 32, as a small statement in the midst of a number of 
geometric constructions: “Nam si quadrantis circuli quilibet arcus, radio CE descriptus, 
sinun habeat CX sinumque complementi ad quadrantem X E; sumendo radium CE pro 
Modulo, arcus erit rationis inter EX + XC./—1 & CE mensura ducta in /—1.” This 
can be translated as follows: “For instance if any arc of a quadrant of a circle, described 
with radius C E [with the center at E], has sinus CX and sinus of the complement to the 
quadrant [cosinus] X £; taking the radius C E as modulus, the arc will be the measure of the 
ratio between EX + XCJ/—1&CE multiplied by /—1.” For Cotes, if A, B, and M are 
positive numbers, “the measure of the ratio A over B modulo M” meant M/1(A/B), where / 
denotes the natural logarithm. Thus, if R = CE and if the arc is denoted by RQ, the ratio 
(EX + XC./—1)/CE equals cos @ + i sin0, and Cotes’ statement becomes i RI (cos @ + 
isin?) = RO. Except for the obvious sign error, this is Euler’s formula. This paper 
is combined with other previously unpublished works in Cotes’ posthumous publication 
Harmonia mensurarum, 1722, pp. 4—41. The stated quotation is on page 28 of Harmonia 
mensurarum and page 170 of Gowing’s translation. 

32 According to a handwritten note by Robert Smith, editor of Harmonia mensurarum, on 
his own copy of this book. 

33 “De la controverse entre Mrs. Leibnitz et Bernoulli sur les logarithmes négatifs et 
imaginaires,” 1749, pub. 1751. Page references below are to the original paper firs and, 
after a semicolon, to the Opera omnia. 
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he had written in Art. 115 of the Introductio (see page 139), he wrote 


(1+). 
n 


where n is infinitel large. Putting this little detail aside, not only did he dare 
to replace the variable in the series for the exponential with an imaginary num- 
ber, but he rearranged the infinitel many resulting terms at will. Implicitly 
gathering all the real terms together and then all the imaginary ones, and in 
view of the infinit series previously obtained for the sine and the cosine, he 
obtained 

e'? = cosy + ising. 


We should say in passing that, besides its usefulness, Euler’s formula has an 
interesting consequence that has always fascinated mathematicians. Putting 
gy = 7, we obtain e'* = —1, which is usually rewritten as 


7 4+1=0. 


This is said to be the most beautiful formula of mathematics, relating the 
fi e most important numbers, 0 and 1 from arithmetic, z from geometry, i 
from algebra, and e from analysis, by three of the most important operations: 
addition, multiplication, and exponentiation 

As for the logarithms of complex numbers, Euler already knew that there 
was aninfinitud of them for any number, as mentioned at the end of Chapter 2, 
and now posed the problem as follows [p. 165; 218]: 


PROBLEM III 
Determine all the logarithms of any imaginary [complex] quantity. 


His solution consisted in taking a nonzero complex number a + bi, letting 
J (aa + bb) = c, and choosing an angle y whose sine is b/c and cosine a/c. 
Then, 
a+bi=c(cosg+ising), 


“or, since c is a positive number, let C be its real logarithm, and we will have” 
l(a+ bi) =C+l(cosg+ising). 


At this point we must depart from Euler’s own presentation, which has become 
archaic in that it does not use the exponential function or his famous formula 
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stated above, lacks rigor by using an “‘infinit number,” and is way too long. If 
we use Euler’s formula instead, as written in the /ntroductio and as is written 
today, and if p is any even integer, then 


l(a+ bi) = C+ 1[cos(g + pr) +isin(g + pz)] 
=C+ i[e@tem] 
=C+ (ot pri. 
This last line is the result as stated by Euler [p. 167; 220]. To rewrite it in 


current terminology, put z = a + bi, in which case c = |z|, C = l(c), and 
gy = arg z. Then we have 


Lz) =1(z|) + @ + pr)i, 


where p is any even integer, giving infinitel many values of the logarithm of 
every complex number z. 

If z is real and positive, only one value of its logarithm is real (the one for 
p = 0, since g = 0). But Euler easily concluded that “all the logarithms of an 
imaginary [complex] quantity are also imaginary” [p. 167; 220]. In particular, 


i(-l) =10) + @s pa) = 1+ p)zi, 
or, as Euler wrote it (but without using the parentheses) [pp. 168; 221—222], 
I(—1) = qui, 


where gq is “any odd number.” 

In June 1746 Euler wrote a letter to his friend Christian Goldbach, at the 
very end of which he revealed an even more surprising fin than the logarithms 
of imaginaries:*4 


that the expression (/ — iv has a real value, which in decimal fraction 
= 0,2078795763, which to me appears to be remarkable. 


The problem of evaluating i’ had a long history. The Italian Giulio Carlo 
de’Toschi di Fagnano (1682—1766), Johann Bernoulli, and Euler himself be- 
fore this time had made some preliminary progress. In 1746 he limited himself 


34 Tn Fuss, Correspondence mathématique et physique de quelques célébres géométres 
du XVIIT"* siécle, tome 1, 1843, p. 383; and on p. 182 of the reprint by Johnson Reprint 
Corporation. 
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to state the one value given above, but in 1749 he gave the complete solution. 
It is very simple to obtain it if we put a = 0 and b = 1 in Euler’s formula for 
the logarithm, for this gives C = /(1) = 0 and g = 77/2, and then 


Mi) = (40 + pri. 
Therefore, 
eae —(37 de pr), 
so that 
if = a pr 

“what is all the more remarkable because it is real and includes an infinitud 
of different real values.”°5 

But in spite of Euler’s enormous success with complex numbers, their ac- 
ceptance was slow, very slow. Solving the cubic was no longer fashionable, 
several other trigonometric series had been obtained without the help of com- 
plex numbers and—more importantly—what were these numbers and what 
could their infinitel many logarithms mean? 


3.6 CASPAR WESSEL’S BREAKTHROUGH 


It may be said that complex numbers made their most important inroads 
into mathematical thought through the back door; not through their algebraic 
prowess but through their geometric representation. After all, if they can be 
represented geometrically they must exist. The firs to try his hand at such a 
task was John Wallis (1616-1703), in Chapters LX VI through LX VIII of his 
Algebra of 1685, but his efforts did not amount to much of any use. 


35 The last formula and the quoted statement are from Art. 97 of “Recherches sur les 
racines imaginaires des équations” (except that Euler wrote ./ — 1 instead of i and 2A in 
place of p), but we have not presented his own proof because the method of using logarithms 
directly is much briefer and more relevant at this moment. His own proof begins by setting 
[Art. 93] 

(a ae biy™™ —x+ yi, 
where a, b, m, and n are assumed to be given. Then taking logarithms, findin the total 
differentials of both sides, and separating real and imaginary parts leads to two equations 
in the unknowns x and y. Solving them gives 


x=c"e"? cos(mg + nl(c)) and y=c"e "’ sin(mg + nl(c)), 


where c and @ are as above. The result follows by putting a =m = 0,b =n =c = 1, and 
Q= 50 + pr. 
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On March 10, 1797, Caspar Wessel (1745-1818), a land surveyor (born in 
Vestby, in what today is Norway but was then part of Denmark), presented a pa- 
per to the Royal Danish Academy of Sciences, On the analytic representation 
of direction, which was published two years later, essentially containing the 
idea of representing each complex number by a vector in the plane anchored at 
the origin. On this basis, he then gave laws of addition and multiplication that 
were geometric in nature. The firs is just the familiar rule for vector addition 
in the plane [§ ij 


Two right lines [vectors] are added if we write them in such a way that the 
second line begins where the firs one ends, and then pass a right line from 
the firs to the last point of the united lines; this line is the sum of the united 
lines. 


He thereby initiated the vector calculus. But the multiplication law, which we 
shall explore next, was Wessel’s main contribution [§4]: 


Firstly, the factors shall have such a direction that they both can be placed in 
the same plane with the positive unit. 


Secondly, as regards length, the product shall be to one factor as the other 
factor is to the unit. And, 


Finally, if we give the positive unit, the factors, and the product a common 
origin, the product shall, as regards its direction, lie in the plane of the unit 
and the factors and diverge from the one factor as many degrees, and on the 
same side, as the other factor diverges from the unit, so that the direction 
angle of the product, or its divergence from the positive unit, becomes equal 
to the sum of the direction angles of the factors. 


Since this write-up is beyond our modern endurance for verbosity (or perhaps 
it is a shortage of modern patience with detailed description), we can rephrase 
it as follows: 


e We consider two vectors from the origin coplanar with the one from 
(0,0) to @, 1). 


e The length of their product is define to be the product of their lengths, 
and 


36 The quotations from Wessel’s paper included here are from the translation by Martin A. 
Nordgaard in Smith, A source book in mathematics, pp. 55—66. They are by section number 
rather than by page. For the original paper and other translations, see the bibliography. 
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e The angle of their product (counterclockwise from the positive x-axis) 
is the sum of the angles of the factors. 


So far this looks like the beginning of vector calculus—which it is—but 
then Wessel stated the following [§ 5]: 


Let +1 designate the positive rectilinear unit and +€ a certain other unit per- 
pendicular to the positive unit and having the same origin; then the direction 
angle of +1 will be equal to 0°, that of —1 to 180°, that of +€ to 90°, and that 
of —e to —90° or 270°. By the rule that the direction angle of the product 
shall equal the sum of the angles of the factors, we have: (+1)(+1) = +1; 
GDC) = -1 CDC) = +h GDC.) = +6; G)D(-e) = —¢; 
(-lD(te) = —e; (-l)(-e) = +6; (+6)(te) = -1; Ge)(-e) = 41; 
(—e)(—e€) = -1. 


From this it is seen that € = ./—1 ; and the divergence of the product is deter- 
mined such that not any of the common rules of operation are contravened. 


In this fashion, Wessel introduced what would eventually become known 
as the complex plane, while using the letter « instead of i, and he also proved 
the formula 

(a + €b)(c + €d) = ac — bd + €(ad + bc) 


from his multiplication rule [§ 10] and the usual rule for division [§ 12]: 


at+eb ac + bd + €(bc — ad) 
cted c+ a? , 

We should observe that Wessel’s geometric multiplication rule has the 
advantage over the usual formula (a + bi)(c + di) = ac — bd + (ad + bc)i, 
reverting now to i instead of «, from a computational point of view. For 
instance, which rule would you choose to evaluate (0.6 + 0.5i)*°, even if a 


hand-held calculator were allowed? Using the latter formula nineteen times 
seems rather forbidding, but noting that the length of 0.6 + 0.5i is 


V 0.67 + 0.57 = 0.781024968 


and that its direction angle is 


0.6 


COS 0781024968 


= arccos 0.76822128 = 39.8055711°, 


we have 
0.7810249687° = 0.007133429 
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and 


20 x 39.8055711° = 796.111422°. 
Then 


(0.6 + 0.5i)?° = 0.007133429 (cos 796.111422° + i sin 796.111422°) 
= (.001712269 + 10.006924879. 


This illustrates dramatically the power of Wessel’s multiplication law. 

In the same manner, Wessel discovered that the direction angle correspond- 
ing to the mth root of a complex number is the original direction angle divided 
by m, and gave the already known formula for the mth roots [$15]. In the 
process, he rediscovered de Moivre’s formula once more, even for fractional 
exponents [§ 13]. In his isolation, as a nonmathematician, he must have been 
unaware of these known facts. The reader is invited to use this method to evalu- 
ate the cube roots contained in the solution of Bombelli’s cubic, x? = 15x +4. 

Although his paper was published, in Danish, in the Academy’s memoirs 
two years after Wessel’s oral presentation, it remained largely ignored until it 
was rediscovered and its value recognized in 1895. 

Meanwhile, a similar geometric representation of complex numbers was 
privately printed, but not published, by an amateur mathematician residing 
in Paris, Robert Argand (1768-1822), in a booklet that did not even include 
the author’s name.°” It also remained ignored until, by a happy series of 
circumstances, Argand’s work was rediscovered by Jacques Frangais (1775— 
1883), a professor at Metz, who published a summary in 1813 together with a 
plea for the unknown author to reveal his name.?® Argand answered the plea, 
and his name was printed in the next issue.°9 

eae posed the problem and the difficult in solving it as follows [Art. 3]: 
to fin 


37 Essai sur une maniere de représenter les quantités imaginaires, dans les constructions 
géométriques, privately printed in 1806 (according to Argand). The often repeated partic- 
ulars about Argand’s name, biographical details, and occupation are now questionable. In 
this regard see Schubring, “Argand and the early work on graphical representation: New 
sources and interpretations,” 2001, pp. 130-134. 

38 “Philosophie mathématique. Nouveaux principes de géométrie de position, et in- 
terprétation géométrique des symboles imaginaires,” 1813, p. 71. 

39 “Philosophie mathématique. Essay sur une maniére de représenter les quantités imagi- 
naires dans les constructions géométriques,” 1813, p. 133. 

40 This translation is from Hoiiel’s second edition, 1874, p. 6. There is another translation 
in Hardy, Imaginary quantities: Their geometrical interpretation, 1881, p. 23. 


SECTION 3.6 CASPAR WESSEL’S BREAKTHROUGH _— 189 


the quantity x that satisfie the proportion 


tl:+x::+x:-1 


[that is, +1/x = x/(—1)]. We stop here... because x cannot be made equal 
to any number, either positive or negative ... 


He provided the solution [Art. 4] with the aid of the following figur (from 
which we have eliminated twelve additional radii that are not necessary for our 


E 


N 


purposes), in which the directed distance KA represents +1 and K/ represents 
—1. Then he stated that 


the condition [the proportion mentioned above] to be satisfie will be met 
by KE, perpendicular to the above and with the direction from K to E, 
expressed in like manner by KE. For the direction of KA is to that of KE 
as is the latter to that of KT. Moreover, we see that this same condition is 
equally met by KN, as well as by KE, these two last quantities being related 
to each other as +1 and —1. They are, therefore, what is ordinarily expressed 


by +/-—1, and —./-1. 


In other words, it is possible to turn 1 into —1 if we rotate 1 counterclockwise 
in the plane by 90° and then by another 90°. The same is accomplished if we 
multiply 1 by /—1 and then multiply the result again by /—1. In this way 
we can think of ./—1 as a counterclockwise rotation of 1 by 90°. 

Argand’s ideas were ignored as much as Wessel’s by the mathematical 
community of the times, but once complex numbers were accepted there was 
frequent mention of le diagramme d’Argand in France. 
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Two additional contributions of 1828, one French*’ and one English,44 
have remained largely ignored since their publication even to this day. 


3.7 GAUSS AND HAMILTON HAVE THE FINAL WORD 


Carl Wilhelm Friedrich Gauss (1777-1855) was a more influentia exponent of 
the geometric representation of complex numbers. Ina letter of December 18, 


CARL FRIEDRICH GAUSS IN 1828 
Portrait by Siegfried Detlev Bendixen. 


a Mourey, La vrai Théorie des quantités négatives et des quantités prétendues imagi- 
naires, 1828, 1861. 


42 Warren, A treatise on the geometrical representation of the square roots of negative 
numbers, 1828. 
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1811, to the German astronomer Friedrich Wilhelm Bessel (1784-1846), of 
the K6nigsberg observatory, he stated that 4% 


we can get a feeling for the total Realm of both Quantities, real and imaginary 
Quantities by means of an infinit Plane, wherein each Point, determined by 
an Abscissa = a and an Ordinate = b, represents as it were the Quantity 
a+ bi. 


But he published this view for the firs time in a paper of 1832, stating that*4 


any complex quantity can be represented by some point in an infinit plane, 
by which it is readily referred to real quantities, one may know the complex 
quantity x + iy by the point, whose abscissa = x, ordinate ... = y. 


On both occasions he described a complex number as a point, not as a vector 
as Wessel and Argand had done, and on the same page of the 1832 paper he 
define subtraction and multiplication in the expected way. He had already 
define division algebraically.*® 

It was at this time that Gauss started “by habit denoting by 7 the imaginary 
quantity ./—1,” 46 and that, referring to numbers of the form a+bi, stated “such 
numbers we call complex numbers,” 4” as opposed to imaginary numbers. The 
name of Gauss carried sufficien authority to establish the geometric ideas on 
complex numbers in mathematics, and, since then and with justifiabl pride, 
many German mathematicians have made reference to die Gaussische Ebene. 
In the words of Einar Hille, the “Norwegians with becoming modesty avoid 
claiming det Wesselske planet.” 48 

The geometric approach to complex numbers was rejected by Sir William 
Rowan Hamilton (1805-1865), of Dublin, who felt that algebra must be sepa- 


43 Briefwechsel zwischen Gauss und Bessel, 1880, pp. 156-157 = Werke, 8, 1900, pp. 90— 
91 = Werke, 101, 1917, p. 367. 

44«Theoria residuorum biquadraticorum. Commentatio secunda,” 1832, Art. 38 = 
Werke, 2, 1863, p. 109. 

45<Theoria residuorum biquadraticorum. Commentatio secunda,” Art. 32 = Werke, 2, 
p. 104. 

46. denotantibus i, pro more quantitatem imaginariam ,/—1, ... “Theoria residuorum 
biquadraticorum. Commentatio secunda,” Art. 30 = Werke, 2, p. 102. Possibly, the firs 
appearance of i in print since Euler introduced this symbol was in Gauss’ Disquisitiones 
arithmetice, 1801, Art. 337 = Werke, 1, 1863, p. 414, where he stated scribendo brevitatis 
caussa i pro quantitate imaginaria ,/— 1 (for the sake of brevity writing i for the imaginary 
quantity ./ — 1). 

47 Tales numeros vocabimus numeros integros complexos [integros meaning complete], 
“Theoria residuorum biquadraticorum. Commentatio secunda,” Art. 30 = Werke, 2, p. 102. 

48 Hille, Analytic function theory, 1, 1959, p. 18. 
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Sirk WILLIAM ROWAN HAMILTON 
From Smith, Portraits of Eminent Mathematicians, Il. 


rate from geometry. In a paper written and read at the Irish Academy in 1833 
and 1835, but published two years later,*9 he differed from other authors 


in not introducing the sign —1 until he has provided for it, by his Theory 
of Couples, a possible and real meaning, as a symbol of the couple (0, ee 


Hamilton preferred to think of a complex number as a couple of real numbers 
(a, b).°' In the second part of his paper, “Theory of conjugate functions, or 
algebraic couples” (the part written in 1833, which starts on page 393; 76; 87), 


49 “Theory of conjugate functions or algebraic couples; with a preliminary and elementary 
essay on algebra as a science of pure time,” 1837. Page references are to the three references 
given in the bibliography in that order, separated by semicolons. 

50 End of the last footnote in the “General introductory remarks.” 

51 Actually, Hamilton worked with several kinds of couples: moment-couples, step- 
couples, number couples, limit-couples, logarithmic function-couples, etc. We shall concern 
ourselves with number couples only. 
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he set down the following basic definition [Art. 6, p. 403; 83; 95]: 


(b1, bz) + (a1, a2) = (b1 + 41, bz + a2); 
(b1, bz) — (a1, a2) = (b1 — a4, bz — ag); 
(b1, bz)(a1, a2) = (b1, bz) X (a1, a2) = (b1a1 — b2a2, bra) + bya); 
(bi, b2) _ (7 + b2a2_ boat — ). 
(aj, a2) ay + a5 ae + a5 


He added that these definition “are really not arbitrarily chosen, and that 
though others might have been assumed, no others would be equally proper.” 
Of course, the definitio of multiplication, for instance, must have been in- 
spired by what we want to obtain through the traditional manipulations: 


(by + boi) (ay + ani) = byay + boayi + byani + byayi? 
= bia, + (b2a, + bjaz)i — baz 
= bya, — byaz + (ba, + bj a2)i, 


where we have used the fact that i? = —1. 

Then Hamilton showed that addition and multiplication are commutative 
and that multiplication is distributive over addition. With these definition and 
laws, he may have started what would later be called modern algebra. 

Clearly, the couples of the form (a, 0) can be put in a one-to-one cor- 
respondence with the real numbers, and we may write, as Hamilton did, 
a = (a,0) [p. 404; 83; 95]. Then, with his definitio of multiplication, 
(0, 1)? = (—1,0) = —1, so that one can think of (0, 1) as representing the 
dreaded ./—1. Then we have 


(a1, 42) = (41,90) + 0,a) =a, + 0,1) =a, + aQv—-1. 
Or, to explain it in his own emphatic words [Art. 13, pp. 417-418; 93; 107], 
. and then we shall have the particular equation 


v (-1, 0) = ©, 1); 


which may ... be concisely denoted as follows, 
/—1 = (0, 1). 


In the THEORY OF SINGLE NUMBERS, the symbol ./—1 is absurd, and denotes 
an IMPOSSIBLE EXTRACTION, or a merely IMAGINARY NUMBER; but in the THE- 
ORY OF COUPLES, the same symbol ./ —1 is significant, and denotes a POSSIBLE 


194  ComPpLex NUMBERS CHAPTER 3 


EXTRACTION, Or a REAL COUPLE, namely (as we have just now seen) the prin- 
cipal square-root of the couple (—1, 0). In the latter theory, therefore, though 
not in the former, this sign /—1 may be properly employed; and we may 
write, if we choose, for any couple (a), a2) whatever, 


(a), a2) =a, +av—l, 


interpreting the symbols a; and ag, in the expression a) +a7./—1, as denoting 
the pure primary couples (a;,0) (az, 0),... 


This formulation as ordered couples really clears any metaphysical worries 
posed by complex numbers. It places these numbers on a solid foundation 
that the geometric interpretation could not provide. 

In 1837, Wolfgang Bolyai, and old university friend of Gauss, wrote him a 
letter chastising him for having propagated the geometric theory of complex 
numbers, while their foundation should be the real numbers, whose arithmetic 
was known. In his reply, Gauss agreed with Bolyai, and claimed that he had 
had the same idea and had regarded complex numbers as ordered couples since 
1831. While the world at large was readier to accept the geometric approach 
than Hamilton’s algebraic one, Bolyai and Gauss knew something that few 
others did at the time: that there are geometries other than Euclid’s. 

Ten years after the publication of Hamilton’s work on algebraic couples, 
Augustin-Louis Cauchy—the creator of complex function theory—stamped 
the seal of approval on complex numbers with his adoption of the letter i for 
./—1, approximately 300 years after these numbers originated. On the second 
page of a paper on “a new theory of imaginaries”®* he stated: 


There was no need to torture the spirit to seek to discover what the symbol 
./—1, for which the German geometers substitute the letter 7, could represent. 
This symbol or this letter was, if can so express myself, a tool, an instrument 
of calculation whose introduction into formulas permitted us to arrive more 
quickly at the very real solution of questions that had been posed. 


52 “Mémoire sur une nouvelle théorie des imaginaires, et sur les racines symboliques des 
équations et des équivalences,” 1847 p. 1121 = Gwvres, Ser. 1, X, p. 313. 


INFINITE SERIES 


4.1 THE ORIGINS 


In the preceding chapters we have seen Newton, Gregory, Euler, and others 
freely using infinit series. This terminology, as applied to the object of our 
study, was introduced in the last third of the seventeenth century. It probably 
appeared in print for the firs time in the second of Gregory’s Exercitationes 
geometrice, published in London in 1668. Here, on pages 10 to 12, we 
read “infinit seriei” twice and “series infinita four times, but Gregory had 
already used “series in infinitum the year before.! Newton already used 
Infinitam terminorum Seriem (infinit series of terms) in 1669, and Jakob 
Bernoulli later wrote a Tractatus de seriebus infinitis that became a classic on 
the subject. 

But medieval mathematicians, writing in Latin, used the word progressio. 
Among the progressions studied at that time, but whose origins go back to 
antiquity, there are three that have come to us with names that are still in 
use. In current notation, if a and r are constants, these are the arithmetic 
progression 

a, a+r, a+2r, a+3r, ..., 


the geometric progression 


2 3 
a, ar, ar’, ar’, ..., 


1 In Vera circvli et hyperbole qvadratvra, in propria sua proportionis specie, inuenta & 
demonstrata, 1667, pp. 34-39. 
? On the firs page of De analysi per equationes numero terminorum infinitas, 1669. 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 195 
DOI 10.1007/978-0-387-92154-9_4, © Springer Science+Business Media, LLC 2011 
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and the harmonic progression 


1 11 1 


i or 

The name of the last one is due to the interest that the Pythagoreans showed 
in music and to their realization that the pitch of a vibrating string doubles, 
triples, etc., as the length of the string is shortened to one half, one third, etc., 
of its original length. 

We have written these progressions as sequences (using the modern word), 
but of course there was an early interest in adding their terms and findin their 
sum. Already in an Egyptian papyrus, transcribed from an older work by the 
scribe Ahmoses c. 1650 BCE, there is a collection of 84 problems, of which 
the 79th simply states 


From Robins and Shute, The Rhind mathematical papyrus, 
an ancient Egyptian text, 1987, 1998. 


which can be translated as? 


The one scale Houses 7 
1 2801 Cats 49 
2; 5602 Mice 343 
4 11204 Spelt 2301 


Together 19607 Hekat 16807 
Together 19607 


On the right (left in the papyrus) we have a finit geometric progression, with 
a =r = 7 (Ahmoses wrote 2301 (1%) erroneously since 74 = 2401), and its 


3 The firs line ends with a 7 (Z), both in the translation and in the original, because the 
Egyptians wrote from right to left. Spelt is a hard-grain variety of wheat, and a hekat is a 
measure of grain just short of 5 liters. 
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sum: 
TAP AP AT 4 = 19607, 


The words written to the left of the terms of the progression have no interest 
for us and can be disregarded (but originally they may have been intended as 
part of a cute riddle, such as “each of seven houses has seven cats, each cat 
caught seven mice... ,” andso on). The most interesting part of this problem 
is the sum of multiples of 2801 in the left part of this problem. Where did the 
number 2801 come from, so conveniently chosen that seven (1 + 2 + 4) times 
this number is the sum of the progression? If we look at the previous sum, 


T+P+P+T7 = 2800, 


it shows that the original author of this problem knew that 19607 = 7(1 + 
2800), and suggests that he obtained the terms of the progression in this 
manner: 


7=7 
T+ 7 =714+7) =56 
74+74+7 =7(1 +56) = 399 
74+74+74+7 =7(1 +399) = 2800 
74+ 74+7RP474+P =7(1 +2800) = 19607. 


To us, who can use notation developed over millennia, this shows that if s,, 
denotes the sum of the firs terms of the progression, then s,4; = 7(1 + sy). 
That is, s, + 7’+! =7+7sp, from which 


Th 1 

Sn =7 7-1 
A general formula, containing this as a particular case, was actually demon- 
strated in Egypt, and has come to us from the pen of Eucleides of Alexandria 
(c. 325 BCE~—c. 265 BCE), usually known as Euclid in an English-language 
context. Euclid, the great compiler of all Hellenic mathematical knowledge up 
to his time, flourishe in the pampering conditions at the Museum of Alexan- 
dria and wrote at least ten scientifi works, most of which have been lost. His 
most famous work was The Elements, a work in thirteen books, of which the 
firs six served as the standard text in geometry for the next two millennia. 
Books VII to IX are devoted to arithmetic, and it is in this last book that we 

fin the following statement: 
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PROPOSITION 35. 


If as many numbers as we please be in continued proportion, and there be 
subtracted from the second and the last numbers equal to the first, then, as the 
excess of the second is to the first, so will the excess of the last be to all those 
before it. 


It is not difficul to translate Euclid’s verbal description into a form that 
is familiar to us and would have been so foreign to him. What Euclid says is 


that, ifa,ar,ar?,ar?,...,ar"*! isa finit geometric progression, then 
ar—a ar"! _@ 
a a+ar+ar? +ar3 +-+--+ar" 


which, if we denote the denominator on the right by s,+1, is equivalent to 


pot as | 


r= 


Sntl =a 


But we are more interested in infinit sums. The firs infinit series ever 
summed was 
ie ty + +(7) + 
4° \4 4 


a feat accomplished by Archimedes of Syracuse (c. 287 BCE—212 BCE) in 
the process of performing the quadrature of the parabola, that is, of findin the 
area of a parabolic segment such as Q Pq in the next figure Archimedes, who 


Q 
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There is no existing portrait that beyond a doubt represents Archimedes. 


had spent quite some time at Alexandria and can be considered a member 
of this school, was one of the greatest mathematicians of all times if not the 
greatest. His life, on return to Syracuse, was entirely devoted to mathematical 
research and to the development of some mechanical inventions. He summed 
the series stated above in a series of propositions that he included in a letter to 
Dositheos of Pelusium, probably at Alexandria. We shall follow the translation 
by Heath.4 

By the segment QPg in the previous figur we mean the area between the 
arc of parabola QPq and the chord Qq. Archimedes chose a point P so that the 
tangent to the parabola at P is parallel to Qqg and then considered two points, 
R and r in Heath’s notation, located on the parabola at the points where its 
tangents (not drawn) are parallel to the chords PQ and Pg, respectively. Then 
he proved, in Proposition 21 [p. 248], that the sum of the areas of the smaller 
triangles PRQ and Prq is one quarter of the area of the triangle Q Pq. 


4 In The works of Archimedes with The method of Archimedes; reprinted by Dover, 1953, 
pp. 233-252. Page references are to this reprint. 
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Next he drew, in each smaller parabolic segment left over (such as the 
one subtended by the arc from P to R) another triangle whose base is the 
chord joining the endpoints of the arc (for instance, the segment PR) and with 
a vertex located on the subtending arc at the point where the tangent to the 
parabola is parallel to the chosen base (for instance, at the point where the 
tangent to the parabola is parallel to PR). As before, the sum of the areas of 
these smaller triangles with bases PR and RQ is one quarter of the area of the 
triangle PRQ. Continuing in this way, he “exhausted” the original parabolic 
segment.° In our terms, if A denotes the area of the triangle Q Pq, then the 
area of the parabolic segment Q Pq is the sum 


asta+(f)at +(7) 4+ 
4 4 4 


But of course, Archimedes did not express himself in this way. Instead, he 
evaluated firs a partial sum of this series as follows [p. 249]: 


PROPOSITION 23. 


Given a series of areas A, B, C, D, ..., Z, of which A is the greatest, and 
each is equal to four times the next in order, then 


APB+ CHD Hert 24g Z = 5A, 


Translated into our notation, this means that for any n, 


asjas(t) at +(7) 445(4) 4=54 
4 € (Gj 5c a al 


oe eee ee 


It is clear, from our present point of view, that as n grows indefinitel , 


1+34(t) 4 +(7) + =e 
A” VA 4 3) 


5 This method of exhaustion to fin areas, which Archimedes perfected and used countless 
times, was introduced by Eudoxos of Cnidos (408-355 BCE). 


so that 
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which yields the sum of the geometric series for r = i. Without disregarding 
the firs area A of the triangle @ Pg, Archimedes phrased his conclusion as 
follows [p. 251]: 


PROPOSITION 24. 


Every segment bounded by a parabola and a chord Qq is equal to four-thirds 
of the triangle which has the same base as the segment and equal height. 


But a long time would pass before the general formula for the sum of the 
geometric series was obtained, and this was accomplished firs in India by 
Nilakantha Somayaji (1444-1545). A native of Sri-Kundapura (Trkkantiyur 
in the local Malayalam language) in Kerala, on the southern coast of India, 
he is chiefl known for his astronomical work Tantrasangraha (A digest of 
scientifi knowledge), c. 1500, but his masterpiece is the Aryabhatiyabhasya, 
a commentary on the Aryabhatiya of Aryabhata.® Here [1930, p. 106] he 
stated and proved the sum of an infinit geometric progression: 


Thus the sum of an infinit series, whose later terms (after the first are got 
by diminishing the preceding one by the same divisor, is always equal to the 
firs term divided by one less than the common mutual divisor.” 


In other words, 
a a 


a a 
Fr F 


yn r—1 


European scholars were unaware of these developments in India, and 
Francois Viéte rediscovered this formula in 1593.8 First he stated a theo- 
rem about the sum of a finit number of terms, but without any notation. Next 
he gave an explanation using notation, as follows: 


Let there be magnitudes [numbers] in continued proportion, of which D is 
the largest, X the smallest, & the composition [sum] of all of them is F’,, and 
let the ratio of a larger to the next smaller term be as D is to B. I say that D 
is to B, as F minus X is to F minus D. 


8 Sastri, Aryabhatiya with the bhashya of Nilakantha, 1930, 1931. 

7 Quoted from Sharma, Hindu Astronomy, 2004, p. 222. 

8 In Chapter XVII, entitled Progressio Geometrica, of Variorvm de rebvs mathemati- 
cis responsorvm, liber VHT, 1593 = Schooten, Francisci Viete opera mathematica, 1646, 
pp. 397-398. The theorem and equations discussed here are from page 397. 
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With these words Viéte referred to the progression 
D+B+---+X=F 
(notice that this is the proper order of its terms, since they must decrease from 
left to right for it to have a finit sum), and then concluded that 
D_F-X4, 


h” Pod: 


Then Viéte drew four additional conclusions, of which the firs was 
Itaque dats D, B, X, dabitur F. Enimvero semi Die equabitur F, 


This equation has been reproduced photographically to be true to the original, 
but it contains a typo. The fraction’s numerator should start with D7, not B?, 
and then the equation on the right should be 


D? — BX 
__ =F 
pes 


’ 


as is easy to deduce from the previous equation. 
Next, Viéte considered the case in which the number of terms goes to 
infinity 


In truth when the magnitudes are in continued proportion to infinit , X goes 
away to nothing. 


This means that the term BX can be considered zero when the number of terms 
is infinite and we obtain 
D2 
D=B 
where F is now the sum of the series. If we write D = a and B = ar, in the 
notation established above, we obtain the sum of the geometric series as 


— F,10 


a 


_—. 


° He gave no proof of this, and none is necessary because it follows immediately from 
Euclid’s formula (in its present-day equivalent, just replace a with D, r with B/D, ar” 
with X, and the sum in the second denominator with F). 

10 Viate stated this equation in the form (D—B)/D = D/F with the following words: “Vt 
differentia terminorum rationis at terminum rationis majorem, ita maxima ad compositam 
ex omnibus.” 


SECTION 4.2 THE SUMMATION OF SERIES 203 


The harmonic series did not fare as well. Using the modern word, it is 
divergent. The firs to give a written proof of this fact was Nicole Oresme 
(c. 1323-1382), a professor at the Collége de Navarre in Paris who would 
later become dean of the Rouen Cathedral and Bishop of Lisieux in Normandy. 
About 1360, he gave the following argument, which we write firs in modern 
symbols, in Questiones super geometriam Euclidis: 


ioe hea 
2 3 4 5 6 7 8 


Clearly, the sum on the right is not finite Oresme’s original argument was: "' 


Add to a magnitude of | foot: 5s is i foot, etc.; the sum of which is infinite 
In fact, it is possible to form an infinit number of groups of terms with sum 
greater than 5. Thus, 7 + 7 is greater than 5 [quia 4° et 3 sunt plus quam 

. wu 1 dp Wyo Te dg So 1 
una medietas], sumnllatly st et t+, isgreaterthan >> 5+ 79+ 77+: +76 
is greater than 5 and so in infinitum. 


4.2 THE SUMMATION OF SERIES 


Gottfried Wilhelm Leibniz (1646-1716), a lawyer turned diplomat, arrived 
in Paris in 1672 on a (failed) diplomatic mission on behalf of the elector 
of Mainz. When in Paris, where he stayed until 1676, he met the Dutch 
scientist Christiaan Huygens (1629-1695) and developed an interest in math- 
ematics. Huygens was Leibniz’s mentor in Paris and, possibly wanting to test 


"1 Quoted from Struik, A source book in mathematics, 1200-1800, 1969, p. 320. Note 
also that Oresme did not write fractions as we do. The inset in Latin illustrates his original 
writing. 
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the ability of the young man, proposed to him the following problem in 1672 
[pp. 14; 404; 49; 16]:" 


To fin the sum of a decreasing series of fractions, of which the numerators 
are all unity and the denominators are the triangular numbers; of which he 
[Huygens] said that he had found the sum among the contributions of Hudde 
on the estimation of probability. 


Triangular numbers are so called because you can form a triangle with as 
many dots as each number, as shown in the next figure Thus, the triangular 


numbers are 1, 3, 6, 10, 15,21, ... or, as we would express this today, those 
of the form n(n + 1)/2 for each positive integer n. 

Leibniz found the answer to this problem very quickly, and it was in agree- 
ment with the one that Huygens already had. He wrote the result in a 1674 
manuscript never published during his life, entitled Theorema arithmetice 
infinitorum,'? firs in narrative form and then as follows: 
re 


| Ti lijzla 1 , 1 ni 4 
Series © 7 +q+g+p+Rta ete. ininfinitu nm 2. 


Two of these symbols need an explanation. Leibniz used M as an equal 
sign, but the symbol © is just a marker that he used instead of numbering 
the equation, and has no mathematical meaning. Thus, Leibniz’ statement is 
that the sum of the series © is 2. Of this fact he included a demonstration, 
which begins by considering two more series. First the one that we have called 
harmonic: 


12 This is as Leibniz himself recalled the problem in his manuscript “Historia et origo 
calculi differentialis.” This quotation is from the printed references given in the bibliography, 
in this order. 

13 This extract is from f. 1” of the manuscript. 
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Series ) t t 5 aie ian es Va 7 etc. in infinitum 


Then another whose terms are half the terms of ©: 


Ob ope lle eS ge, A ah 
6 9-2 20 F304 


5 etc. in infinitum 


Series 8 54 q 


This series, he asserted, has sum 1. He began his proof by subtracting ¢ from ) 


4,9 , 16, 25 , 36 
2 + 36 + 80 + T30 + 352 


to obtain 5 etc., which, after simplifying, becomes 


SthtGt+it+it4 ete. in infinitum 


series 9 


That is, Leibniz had shown that ) — % = 9, and then he pointed out that 
g@ = )—1. It follows that § = 1 and therefore twice the series §, or the 
series ©, equals two. In his own words: 


Ergo dupla series & sive series © erit eequalis binario. 


Today’s readers may be shocked by Leibniz’ use of divergent series with 
such flagran disregard for the appropriate mathematical etiquette. He thought 
nothing of it, and neither did most of his contemporaries. They were used to 
rigor in geometry, where it belonged, but what many other mathematical field 
wanted was exploration. Obtaining results was what counted, and the niceties 
of mathematical rigor would have to wait for a better time. In this particular 
case, what counted is that the sum of the series in question is actually 2. 
However, modern presentations of Leibniz’ result are usually “sanitized” in 
the following manner: 


2°6 12° 20° 30° 


-2[(-3)+G-3)+G-a)+G-s)+G-a) 
7 3 2 3 3 4 4 5 5 6 
which, after all the obvious cancellations, equals 2. 

After this result and the invention of a calculating machine that could 
perform addition, subtraction, multiplication, division, and the extraction of 


roots, Leibniz went on to achieve immortal fame in mathematics with his 
development of the calculus. 
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However, he failed to solve a problem in infinit series that had originally 
been posed to him by Henry Oldenburg, in a letter of April 1673: '* to sum the 
reciprocals of the squares of all the natural numbers, a task at which Pietro 
Mengoli had already failed. A little over one month later, Leibniz confessed 
that he had not yet found the sum.'® He never found it. 

Next to succeed at summing infinit series was Jakob Bernoulli (1655— 
1705), of Basel, who included some known results—such as the sum of the 
geometric series, the divergence of the harmonic series, and the preceding 
series summed by Leibniz—in his Tractatus de seriebus infinitis, which was a 
collection of fi e papers, written between 1689 and 1704, published as an ap- 
pendix to his Ars conjectandi,'® all put together by his nephew Niklaus. In this 
appendix, Bernoulli also set out to sum some new series with a considerable 
degree of success, but the procedure was essentially the one already worked 
out by Leibniz and illustrated in the preceding equation: the decomposition 
of each term into a difference and the cancellation of most of the terms. Then 
he attempted to sum the reciprocals of the squares of the natural numbers. 
Bernoulli made some progress, for he realized that each term of this series is 
smaller than the corresponding term of the series that Leibniz had summed, 
and therefore this series must also have a sum. But as for findin it, Bernoulli 
had to admit that [p. 254] 

when [the denominators] are purely Quadratic, as in the series 

aa oe ee 
eg ge a5 
it is difficult more than one would have expected, to search for its sum, ... 


So he could not fin the sum, concluding the paragraph with the following 
plea [p. 254]: 


If someone discovers and communicates to us what has eluded our efforts 
thus far, he will earn our deep gratitude. 


Much later, Euler started with some numerical approximations and found 
the approximate value 1.644934 for the sum of the series [Art. 22],"” but this 


Me Gerhardt, Der Briefwechsel von Gottfried Wilhelm Leibniz mit Mathematikern, 1899, 
p. 86. 

15cumman nondum fateor reperi,” Gerhardt, Der Briefwechsel von Gottfried Wilhelm 
Leibniz mit Mathematikern, p. 95. 

16 Page references are to the 1713 edition. 

17 “De summatione innumerabilum progressionum,” 1730/31, pub. 1738. References to 
this and the next paper by Euler are by article number. 
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JAKOB BERNOULLI IN 1687 
Portrait by his brother Nikolaus. 


figur was not sufficien to make him guess the exact answer. However, in 
1735, he was able to report his success in obtaining the exact value as follows 
[Are 2:1 


Recently, however, and altogether unexpectedly I have found an elegant ex- 
pression for the sum of this series 
1 1 


1 
I$o4—4— dete, 
ae Gee 


which depends on the quadrature of the circle... In fact, I have found that 
six times the sum of this series is equal to the square of the circumference of 
a circle whose diameter is 1, ... 


In current terminology, this means that the sum of the series is 77/6. If we 
wonder about the strange description of Euler’s result in words, note that at 


18 “De svmmis seriervm reciprocarvm,” 1734/35, pub. 1740. 
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that time there was no universally accepted symbol for 7.9 In 1735, Euler 
used p instead of z, and, because of the lack of a standard symbol, he must 
have thought it appropriate to state the result in unequivocal words. However, 
it was Euler’s use of the letter zr, starting in 1736,2° that made this symbol 
popular. 

Euler began with the series for y = sin s (see Section 4.3), 


s s» s’ 


= + etc. 
mr aa ee oe ee ee 
[Art. 3], and then found the sum of the stated series [Art. 11] by a long method 
that we shall not present here. Fortunately, he then did it again by a shorter 
method that starts by putting y = 0 and dividing by s if s #0 [Art. 16]: 


s? s4 s® 


0=1 te. 
[2.3 (20.8406 14-5 
The sine of s, and therefore this series, vanishes for s = +kz (we shall use z 
while Euler still used p). Therefore, the right-hand side can be written as the 
product of the factors 


Ss 
1-— 1 >» 1 » 4 , tc., 
XT ee 20 20 ae 


and combining them two by two we have 


Ce st s 


1 } + etc. 
i058." (ea Teese 


2 2 2 2 
= ‘fi 1 1 1 te. 
( =)( all =)( =) = 


From this Euler concluded [Art. 17]: 


It is now clear from the nature of this equation, that the coefficien of ss or 
733 is equal to 


19 The symbol for 3.14159 ... was introduced by Sir William Jones in his Synopsis 
palmariorum matheseos, 1706, p. 236, but had not yet become popular. 

20 Starting in Mechanica sive motvs scientia analytice exposita, 1736, vol. 1, p. 119, with 
the statement “denotante 1:2r rationem diametri ad peripheriam” = Opera omnia, Ser. 2, 1, 
1912, p. 97. 
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In short, 
\ el 1 ; Se 
Pat gvqg = 6 
Upon learning of this result, Johann Bernoulli (1667-1748) wrote: “if only 
my brother were alive.” *' In the same paper [p. 133] Euler found the sum of 
the p-series for p = 4, 6, 8, 10, and 12 [Art. 18], and later, in 1740, he found 
many more sums by similar methods.24 
Euler was 23-24 years of age when he obtained his firs results on the 
summation of series, but he would be from this moment forever to be known 
in the fiel of mathematics. However, in a letter to Euler of April 2, 1737, 
Johann Bernoulli pointed out some defects in the preceding method, which 
Euler admitted in his reply of August 27, 1737. Also, Niklaus Bernoulli 
(1687-1759), a nephew of both Jakob and Johann, wrote him a letter in 1743 
to point out that in expanding an infinit series into a product using its roots, 
he was treating the series as if it were a polynomial, a criticism that was well 
founded. But Euler was not to be distracted from his path, for he believed in 
the validity of his result even in the absence of proof. He devoted Chapter X of 
the Introductio to the sum of infinit series, including once more the p-series 
previously obtained, on the basis of the following statement [Art. 165]: 


If we have 14+ Az+ Bz? 4+ C234 Dz + &c. = (1+ az)(1 + B21 4 
yz)(1+6z) &c., these Factors, whether they are finit or infinit in number, 
if they are actually multiplied by each other, must produce that expression 
14+Az+Bz2?+Cz3+ Dz*+ &c.. Therefore, the coefficien A will be equal 
to the sum of all the quantities w+ B+ y+6+¢€+&c.. The coefficien 
B will be equal to the sum of their products two at a time, and it will be 
B=aB+ay+aéd+ By + 66+ yd + &c.. Then the coefficien C 
will be equal to the sum of the products three at a time, namely it will 
be C = aby +.aB5 + Byd +ay5 + &c. And soon....2% 


21 The original statement, “utinam frater superstes esset”, is in Johannis Bernoulli opera 
omnia, 1742, vol. 4, p. 22. 

22 “De seriebus quibusdam considerationes,” 1740, pub. 1750. 

23 A similar result for the roots of a polynomial had already been given by Newton in his 
Arithmetica universalis, sive de compositione et resolutione arithmetica liber, 1707. This 
was written as a text for Newton’s lectures from 1673 to 1683. The statement in question is 
on page 20 of the English translation by Raphson as Universal arithmetick, London, 1720, 
which can also be seen in Struik, A source book in mathematics, 1200-1800, p. 94. Also in 
Whiteside, The mathematical papers of Isaac Newton, V, p. 359. Newton did not provide 
a proof, and, in fact, no proof was yet available at the time that Euler’s Introductio was 
published. 
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Then he define [Art. 166] 


P=a+Pt+y+é+e+&e, 
O=a?+ pr+y?24+04+ +4 &e. 
R=0C4+64y4+H8+e4 &. 
S =ot+pt+y44+ 8444+ &. 
&e. 
and asserted that 
P=A 
O = AP—2B 


R = AO— BP +3C 
S =AR—BO+CP—4D 
&e. 


“the truth of which formula is easy to know by simple inspection: until it is 
proved with maximum rigor in the differential calculus.” Dispensing then 
with a proof, as Euler did, we shall be content with a simple inspection of the 
second equation in the last quartet. If we change Euler’s notation a, B, y,... 
to a1, @2, 03, ..., So that we can indulge in the use of friendly subscripts, and 
if we write 


A=P=)> wi, B=) aja; and =) w, 
i<j 
it is possible to believe that 


AP = (doi) (Soa) = So af +2) ¢ aja; = Q0+4+2B, 
i<j 
so that OQ = AP — 2B. 
Accepting Euler’s equations on this narrow basis, we now return to the 
basic identity 24 


s? s4 x? 


\ ‘ 
ies ides 1-2-3-5-7 


s? s? s? s? 
Ad (lk 1 1 1 &C. 
( =) 7=)/( =3)( =) . 


24 Euler did not do this, but began Art. 167 with another identity that he had proved in 
Art. 156. We shall stay with his older expansion to avoid proving the new one, but otherwise 
will proceed exactly as he did. 


+ &c. 
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2 


and make the substitution s2 = z to obtain 


m2 2 m4 4 7° 6 
1 | L &c, 
12925 "TSS 1.9535-5-7 . 


=(1 1 - (: 7) 1 Ja 
= ( 2 3) 52) ( 16° Cc. 


“Therefore, applying the rules found above to this case, it will be” 


ie n4 ad ‘a 


’ ’ C — fi ’ D= 
6 120 5040 362880 


Therefore if we put 


1 
P=-l ae ey. &e. 
42 92 «162 
R=-l : &e. 
4 99 163 
S = 1 t T T + &C 
44 94 164 
we obtain 
Lt p+ ap tec.=O=AP-28=4?-28= 2 — = = al 
34 36 = 60 90 
lbs 1 wee 1 4 &. = —R=—-AQ+BP—3C = moe : aa = mo 
36 540 720 5040 945 
and 
1 1 
1+ 53 +35 +&.=S=AR-—BO+CP-—4D 
8 8 8 8 8 


1s 4 IU 4 4 
T 


~ 5670 10800 | 30240 90720 9450. 
Euler concluded [Art. 168]: 


It is thus clear that all the infinit Series contained in this general form 


14 + &C., 
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every time that n is an even number, can be produced by means of the Pe- 
riphery of the Circle zr; and in fact the sum of the Series will always have a 
rational ratio to zr”. 


After this he gave the sum of the p-series for all even values of p up to p = 26. 

However, while he was able to give approximations to the sum for some 
odd values of p in his 1740 paper [Art. 29], Euler was unable to sum a p-series 
for p odd. Nobody has ever been able to sum such a series. 


4.3 THE EXPANSION OF FUNCTIONS 


Obtaining the sums of particular series of numbers is of relatively little interest 
in comparison with expanding a function in a series whose terms depend on 
the variable x. Such series can be used to compute values of the expanded 
functions, such as in the case of the binomial theorem and the expansion of 
the logarithm, both already covered in Chapter 2, and obtained by various 
ingenious methods in the 1660s. 

As for the series for the sine function, which we saw Euler using in the 
preceding section, it was firs obtained by Madhava of Sangamagrama—the 
present-day Irinjalakuda, in Kerala—who lived from about 1340 to 1425. His 
writings on this and other infinit series have not survived, but they are quoted 
in two works. One is the Yuktibhasa (An exposition of the rationale), c. 1550, 
of Jyesthadeva (c. 1500-1575) and the other is the anonymous Tantrasan- 
grahavyakhya, a commentary on Nilakantha’s Tantrasangraha written by a 
student of Jyesthadeva, probably while his teacher was still alive. Both works 
contain statements for the series for the sine, cosine, and arctangent, but only 
the Yuktibhasa shows the demonstrations. 

However, the statements in the Tantrasangrahavyakhya are important be- 
cause they contain a second version of the sine series, which was quoted 
in Nilakantha’s Aryabhatiyabhasya and attributed to Madhava.*° It can be 
assumed, then, that the versions in the Yuktibhasa and the Tantrasangra- 
havyakhya are also by Madhava. The statement of the sine series in this 
last work, both in Sanskrit and in English translation, is as follows.”6 


25 See Sastri, Aryabhatiya with the bhashya of Nilakantha, No. 101, 1930, p. 113. 

26 These are taken from Rajagopal & Rangachari, “On an untapped source of medieval 
Keralese mathematics,” 1978, 95-96. The authors based their translation on an original 
manuscript in the Government Oriental Manuscript Library at Madras. This article also 
contains Nilakantha’s version in the Aryabhatiyabhasya, p. 97. 
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(iS paqitoy NG adeRathi-a 


Rasa gor yomanee: RANA. 
AG Bea ATMNSEN SENIZIR Ser, 
Sorer xrrais art fag ayRa nee | 


The arc is to be repeatedly multiplied by the square of itself and is to be 
divided [in order] by the square of each even number increased by itself and 
multiplied by the square of the radius. The arc and the terms obtained by 
these repeated operations are to be placed in sequence in a column, and any 
last term is to be subtracted from the next above, the remainder from the term 
then next above, and so on, to obtain the jyva of the arc. It was this procedure 
which was briefl mentioned in the verse starting with vidvan. 


Since we are temporarily back to sixteenth-century mathematics, we must 
explain a statement made in narrative form. Ifs denotes the arc corresponding 
to a central angle @ and R is the radius, the firs sentence refers to the terms 


2 2 2 


S S S 


2 42)R2 * Q242)R2) 42 44)R2 


5, 


and so on. The rest of the statement means that we have to alternately add and 
subtract these terms to obtain the jyva of 0. That is, 


a s° s! 


6R2 ' 6R2-20R2  6R2-20R2 - 42 R2 


jyad=s tere, 
and replacing jya@ with Rsin@, dividing by R, and writing s = R@, this 
equation becomes 
a.  @? 
Se Fa ag 
which is the familiar infinit series for sin 0. 

These discoveries could have reached Europe through many Portuguese 
explorers and mathematically savvy Jesuit missionaries that were in Kerala in 
the sixteenth century. But if Indian mathematics was imported into Europe at 
that time it made no significan impact. European mathematicians carried on 
as if they knew nothing about it, which is probably the case. They rediscovered 
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Madhava’s series more than 250 years after they were originally proved, and 
then they found new series expansions. 

Newton found the following expansion for arcsin, among others, “in winter 
between the years 1664 & 1665,” which he did via integration and the binomial 
theorem as in the case of the logarithm.?’ If z denotes what we call arcsin x, 
then ae 

xX xX X 
z=x 6 + 7 tT &e. 
Newton restated this result, exactly in this way, in De analysi, along with 
the series for the sine and cosine, which were new to him [pp. 17; 236— 
237].78 John Collins, librarian of the Royal Society, received a copy of this 
work through Barrow, Newton’s colleague at Cambridge, and communicated 
these results to James Gregory (1638-1675), of the universities of Padua 
(1664-1668), Saint Andrews (1668-1674), and Edinburgh (1674-1675), on 
24 December 1670 [p. 155; 54].79 

In his response of 15 February 167 Gregory said: “I thank you for the 
serieses you sent me, & send ye these following in requital” [p. 170; 62]. Two 
of these are, in current notation (they will later be seen in Gregory’s), 

6? 26° «1707 
tand =@+ —+ + 


1,20 


a eee 
and 

en 

ee eT a A” 700 


27 Reproduced in Whiteside, The mathematical papers of Isaac Newton, I, 1967, pp. 108— 
109. The result is not explicitly written in the original as it is here. Columns | and 3 of the 
table on page 109 must be multiplied to obtain it. Note Newton’s trivial error on page 108, 
as explained in Whiteside’s footnote. 

28 The firs page refers to Jones’ edition, and those after the semicolon to Whiteside’s 
translation. 

29 Any double reference to Gregory’s work or correspondence in this chapter is to the works 
by Turnbull, James Gregory tercentenary memorial volume, 1939 and The Correspondence 
of Isaac Newton, I, 1959, in this order. If only one page reference is given, it is to the 
memorial volume. Most of Gregory’s papers were lost or put away and ignored soon after 
his death, and thus the man who was second only to Newton in the early 1670s was almost 
forgotten, and his very important contributions to mathematics were not rediscovered until 
the publication of Turnbull’s memorial volume. It contains reproductions and translations 
of Gregory’s manuscripts, a summary of the contents of his books, and Turnbull’s extensive 
researches and interpretation. In this and the next section, I rely heavily on Turnbull’s 
interpretations. 

30 For the sake of authenticity, dates of letters and manuscripts mentioned in the rest of 
this book are given in the style in which they were originally written by their authors: small 
unit (day), middle unit (month), and large unit (year). 
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JAMES GREGORY 


His only known portrait, at Marischal College. 
University of Aberdeen. 


Gregory did not explain how he obtained the expansions. But findin out is 
important because it will turn out that while previous series expansions by 
others had been found by disparate methods, these were obtained by a general 
procedure that has been in use ever since. This method can be obtained on 
the basis of a very valuable interpolation formula firs found by Gregory and 
communicated to Collins in a letter of 23 November 1670. The same kind 
of work was carried out by Newton, independently, in his 1676 manuscript 
Regula differentiarum. It was published in 1711 with the title Methodus differ- 
entialis (Method of differences), but a modifie version had already appeared 
in Lemma V of Book III in the Principia.*" 

The function concept and the now usual notation for functions did not exist 
at the time of this discovery, but it will add clarity if we present it in modern 


31 Philosophie naturalis principia mathematica, 1686, pp. 481—483 = Motte’s translation 
The mathematical principles of natural philosophy, 1729, pp. 333-336 
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notation and terminology in addition to the original notation by Gregory. This 
will allow an easy comparison of the main result of the next section with 
its usual statement at present. Assume then that c is a positive constant and 
that y = f(x) is a function whose values are known at the points of an evenly 
spaced collection of points x; = ci,i = 0,...,. Our problem is to determine 
the values of f at every other point x or, as Newton put it, Given any number of 
points, to describe a curve which shall pass through all of them.** To present 
the solution, we defin the first and higher-order increments 


Af (xi) = fQi +0 — fi) 

A? f (xj) = Af (ai +0) — Af (i) 
A? f (xj) = A? f (i +0) — A? fi) 
Af f (xi) = AB fai +0) — 3 f(a) 


But instead of using these, Gregory, in his letter to Collins, and in reference 
to a figur like the next one,?? said [p. 119; 46]: 


A 


I oa O P 


In the end of my Geometrical exercitations I fail exceedinglie ... and hence 
in place of any thing I have described there; putting AP = PO = c, 
PB=d,... 


Thus, with the origin at A and the direction of the positive x-axis to the left, 
d = Af(O) = f(c) — f(). Then he define the higher-order increments in 


32 “Datis quotcunque punctis Curvan describere que per omnia transibit.” in Whiteside, 
The mathematical papers of Isaac Newton, IV, pp. 60-61. 

33 Fig. 8.%, Exercitationes geometric, 1668. I have omitted some inscribed rectangles 
that are irrelevant in this context. 
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verbal form, denoting them by f (“primam ex differentiis primis”), 4, i, and 
so on, which in our notation can be expressed as 


f =A’ f() = Af(c) — Af (0) 
h = A> f (0) = A’ f(c) — A? f (0) 
i= A*f£(0) = A? f(c) — AFF (0) 


With all this, he was able to state his result as follows [pp. 119-120; p. 46]. 


In the eighth figur of my exercitationes imagine any straight line [segment] 
Aa in the straight line AJ to which wy is perpendicular, with y on the curve 
ABH [the segment ay has been added above to the original figure] which 
remains as before, consider the infinit series 

a a-—c a-—2c a-—3c 


; ; > etc., 
c 2c 3c 4c 


let the product of the firs two terms of the series be b/c, that of the firs three 
k/c, that of the firs four //c, that of the firs fi e m/c, etc., to infinity then 
the straight line [segment] 


ad bf kh ili 
t t t + etc. 
Cc 


ay= 
c ce c 


to infinit . 


To interpret the last equation replace the quotients b/c, k/c, l/c, ... with 

the products that they stand for and then replace d, f, h, i, ... with the 

expressions given above. Then, since wy = f(a) in current notation, where a 

is the variable, we have 

a(a—c) 5 a(a — c)(a — 2c) 

——— A* f (0) + ————_ 
2c FO) c-2c-3c 


Cc: 
a(a — c)(a — 2c)(a — 3c) 
c:2c-3c-4c 


f@= —Af) an A} f(0) 


A‘ f (0) + ete. 


This is Gregory’s formula,** and it can be considered only an approxima- 
tion if a is arbitrary. Of course, if f(0) # 0 then we can add f(0) to the 
beginning of the right-hand side. 


34 Newton did not originally state a formula. In 1676—in what may be considered amemo 
to himself rather than a paper—he just gave a table of divided differences (our increments 
divided by Gregory’s c), hoping perhaps that the rest would be obvious. But his notation 
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We can further generalize the formula by replacing 0 with a; a witha +h 
in the left-hand side; and any a in the numerators of the right-hand side, where 
it has the meaning a — 0, with h. If we also replace c with Ax, it becomes 


Af(a)  hh=AnN ML’ f@ 
1-200 Axe 5 | 


flath)= f@th 


It must be pointed out that this formula can be used to obtain the binomial 
theorem by considering the special case 


mee 
f@= +2), 


a function whose values are known at x = 0, 1,2,... In this case, f(0) = 1, 


d d 

A at ae a a ras aa 

A? f(0) = Af + 1) — Af) = fF +2) — FO +1) — Af) 
(1447 _(44.9)_22 
=( i) ( 5) bb 


and so on. In this manner, putting a = 0, 4 = a/c, and Ax = 1, 


aja aja a 
2a eee see | ee, 
a\“l* d ( a ( )( )e 
(1+ 2) =145548 —— s 


2 Pp ere: aes 
This is the binomial theorem, but we have worked it out in current notation. 
In an enclosure sent with his letter to Collins of 23 November 1670, Gre- 
gory posed a problem and then gave its solution by what is essentially rec- 
ognized as the last equation, but did not explain how he arrived at it. It is a 


is very foreign to modern tastes. Suffic it to say that he denoted his x-values by A + p, 
A+q,A+r..., the function values at these points by a, 8, y ..., the first-orde divided 
differences by ¢, 7, 0..., the second-order divided differences by A, w..., the third-order 
divided differences by € ..., and so on. Pity he had not discovered the use of subscripts. 
In 1711 Newton included some explanations from which the formula can be reconstructed. 
This was done—in Newton’s own notation—by Whiteside in The mathematical papers of 
Isaac Newton, IV, pp. 63—64 and VIII, p. 248. However, the curious reader is warned 
that a certain amount of work would still be required to transform Whiteside’s formula 
into the one stated here. There is a formula in the Principia’s version: “RS will be = 
a+bp+cq+dr+es+ ft, + &c,” but it also requires careful interpretation. 
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permissible conjecture that he may have obtained this result from his interpo- 
lation formula as shown above, but in his own notation. Here is a translation 
of Gregory’s Latin passage [pp. 131-133]: 


To fin the number of a logarithm. 


Given two numbers, the firs b, the second b + d, let the logarithm of b be e 
and the logarithm of b + d be e + ¢, it is desired to fin the number whose 
logarithm is e + a. 


Consider a series of continued proportions, 


dz a 


and another series 


a a-c a—2c a—3c 

2c 3c 4c 
let the product of the firs two terms of this series be f/c, that of the firs three 
g/c, that of the firs four h/c, that of the [first fi e i/c, & c.; the number 
whose logarithm is e + a will be 


&C.; 


ape ad , fad’ | ed? ) hd4 | id? ) kd® - &o.35 

c cb cb? cb? cb* cb 
Hence with some work but no trouble any pure equation whatever may be 
solved. 


If the last equation does not look like the binomial formula stated above, it is 
a matter of moments to recognize it as such. Just replace f/c to k/c with their 
definitions divide throughout by b, and the last series becomes 


is ad | a(a —c) d* | a(a~c)a-2c)d” |g 
cb c-2c b? c:-2c-3c b3 


which, as we have seen above, is the binomial series of 


d a/c 
1 = é 
(+5) 


35 Actually, Gregory omitted the factor d/b in several of the terms, but corrected the 
mistake in a new letter of 19 December [p. 148; 50]. 
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Thus, the last series in the quotation from Gregory corresponds to the product 
of this expression by b. The logarithm of this product is, using Euler’s notation, 


I(b) + [Ib +d) —1b)] =e + (e+e —e) =e +a, 


as stated by Gregory. Thus, we see that Gregory essentially discovered the 
binomial series in 1670 independently of Newton. He knew what he had in 
his hands, for he gave an example to fin the daily compound interest rate 
equivalent to 6% per annum [pp. 131—133], that is, what we now express as 


1 


100{ 1 Ve 
(i+2)". 


Then in a letter to Collins of 19 December he used the binomial theorem to 
perform several term-by-term integrations, giving the area under a circle, the 
expansion of the arcsine, and the length of the curve y = r log x [pp. 148-149; 
50-51]. Of course, Newton put the binomial theorem to greater use, not only 
in the case of the quadrature of the hyperbola, but also making it an essential 
tool in his development of the calculus, as we shall see in the next chapter. 


4.4 THE TAYLOR AND MACLAURIN SERIES 


Returning now to Gregory’s interpolation formula, essentially the same result 
was obtained by Brook Taylor (1685-1731), who had been secretary of the 
Royal Society from 1714 to 1718, and is contained in a letter dated 26 July 
1712, to John Machin (1680—1751),°° professor of astronomy at Gresham 
College. Then he published it in 1715,°” giving proper credit to Newton’s 
work in the Principia, and including a “demonstration” and two corollaries. It 
was in the second corollary, after the proposal: “If we substitute for evanescent 
increments the fluxion proportional to them,” that he obtained the famous 
Taylor series, as it was called by Colin Maclaurin (1698-1746) and as it 
has been known ever since. What Taylor meant by substituting “evanescent 
increments” is better expressed now as “letting Ax — 0” in the modernized 
Gregory formula, which gives, in today’s notation, 


f"@ 12 ug f(a) 3 


fath)=fa@tf(@h+ T 31 


36 Reproduced by Bateman in “The correspondence of Brook Taylor,” 1906-1907. 


37 Methodus incrementorum directa & inversa (Direct and reverse methods of incremen- 
tation), 1715, p. 21. 
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(25% a) £.Me, Polne gno pptex estos 
code tempore w eefeendo fet nt gO pg obey 
. ° nage 
+ &e, 4 
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Et ipfis =, x, x, x,8c.lifdem manentibus, mutato figno ipfivs v, 
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defen ma ets ~s- oe Fer on We aca notate. 


A? Ld 
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pte relma fa ner 1, quo tem 
Pose 
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Wteretyai obetge onuT sme tt ity 


PCD Sie +e G ROP. 
last 3m 


BROOK TAYLOR IN 1714 AND HIS SERIES 
Portrait by an unknown artist. 
Taylor’s series appears in Corollary II. 


If we put a = 0 in the Taylor series and replace h with x, we obtain 


f") 2 fl") 3 

7 ea 

which is called the Maclaurin series because it was frequently used by Maclau- 
rin in his Treatise on fluxions, published in Edinburgh in 1742. Maclaurin, of 
course, made it clear that the result was “given by Dr. Taylor.” 

Taylor, who credited Newton, did not mention Johann Bernoulli, who 
claimed to have published an equivalent result eleven years before. This 
prompted one of the many disputes in which Johann Bernoulli used to fin 
himself embroiled.*® 
However, the whole story must be told, and to do that we must return to 


Gregory’s series expansions for the tangent and the secant. Gregory did not 


f(x) = FO) + f'O)x + 


38 In “Additamentum effectionis omnium quadraturarum & rectificationu curvarum per 
seriem quandam generalissimam,” 1694, p. 438 = Opera omnia, I, p. 126, Bernoulli had 
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give them in our notation, but, instead, wrote the following (translated from 
the Latin) in his letter to Collins of 15 February 1671 [p. 170; 62]: 


Let radius = r, arc = a, tangent = f, secant = s, it will be 


re r t? 1° 
5r4—s 7r6 


and it will be 


ead a 2a° 17a’ 3233a° 
~ ON 3724 | " 18144073 


15r4- 3315r6 


and 
a 5a‘ 6la°® 277a8 
s=re4 t t t : 
806417 


2r © 24r3 ° 720r5 


The last two series are the same as the ones previously quoted at the beginning 
of Section 4.3. To see this in the case of the tangent, note that a = ré and that 
Gregory still used trigonometric lengths rather than trigonometric functions. 
For example, his tf means r tan 0, and then, replacing t and a with the values 
given here, the second series in the quoted paragraph becomes 


; 69404 28 170" , 32330" 
eS Og ag ae 19 iaap 


’ 


as given in Section 4.3. 
The concept of factorial did not exist in the seventeenth century, but to 
write this series as we might expect it today, note that 


3! 5! 7! 9! 
=— 15=—, 15 = —, 181440 = —, 
3 5 5 g 315 16 and 5 


used successive integrations by parts to show that if is a function of the variable z, then 


Int d ; d + zzdn + zddn zidddn 
egr.ndz n —+n 
—_— 0 : “ 1-2-dz 1-2-3-dz2 1-2-3-4-dz3 


It is easy to obtain the Maclaurin series from Bernoulli’s formula, but not immediately. 
Indeed, if f is an infinitel differentiable function, apply Bernoulli’s formula with n = f"', 
n= f",n= f",n = f™, and so on to infinit , performing the integrations on the left 
in each case. Then multiply the resulting equations by 1, z, z7/2!, z3/3!, and so on, and 
add the results. Canceling all terms containing f’(z), f’(z), f’”(z), ... will yield the 
Maclaurin series for f. The details are left to the reader, as well as passing judgment on 
Bernoulli’s claim. 
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and then Gregory’s series for the tangent becomes 


and a8 2g + ps . 2p7 , 466 55 
A ag ae age agg 


Those who own the appropriate electronic means of computation or have some 
free time on their hands will have no trouble verifying that this is the Maclaurin 
expansion of tan @ (except for a numerical error in the last coefficient which 
we Shall discuss later). 

But in 1670 neither Taylor nor Maclaurin had been born, and it is clear that 
Gregory could not have used their work. So, how did he fin his expansions? 
The answer lies (and lay dormant until 1938) on some manuscript notes that 
he wrote about a fortnight before his letter to Collins giving these series. 
They fil some of the blank space of a letter that he had received from his 
bookseller, Gideon Shaw, dated 29 January 1671 (paper was expensive in 
those times and Gregory was in the habit of writing on the blank spaces of 
received correspondence). Gregory’s notes start as follows [p. 350]: 


(i) arc =a 1s one 3r 
sine = s 
. r 2r* 
radius=r m=q m=z m=—; 
c vc 
secant = v 5 
. r-q vc 
sinecom.=c t=— t= — — 
) 2q 
tang. =q 


They are not particularly intelligible from a current point of view, but they 
can be interpreted, and what follows is an elaboration of Turnbull’s interpre- 
tation. It is clear that the left column just establishes the notation for several 
trigonometric items, “sine com.” being the sine complement ,or cosine, and 
since we are dealing with lengths, c = r cos 6. Note that the tangent (meaning 
rtan6) is now q, and not ¢ as before, and we are interested in the values 
of m given on the third line. These are the values of the ordinate of the curve 
q =r tané in the 1* column, what we now call its firs derivative with respect 
to 6 in the 2" column, and its second derivative in the 3™ column. We are not 
concerned with the values of ¢, which now stands for the subtangent. 

To verify the statements about the derivatives of g, note that (in current 
notation) 

dq r - r 


dd cos?0_ (rcos6)2c2 
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and then, from the firs of these equations and using the quotient rule, 


dq _ r[2 cos 0@(— sin @)] _ 2r sind _ 2r4(r tan 0) 2r4q 


doz cos* @ ~ cos8@  (rsec0)(r3cos? 6) vc3 


Gregory may have thought that computing “higher derivatives” in this way 
was time-consuming. In any event, he modifie his approach and then wrote 


2q° 
2 9 


g 
m=q m=r+— m=2q+ 
r i 
which is another way to write qg and its firs two derivatives. Indeed, this form 
of the firs derivative is obtained as follows: 
r? tan? @ a 


a eee) aes =r¢4 
dQ ~—cos?0 r oe 


and the second can be found using the chain rule 


dq d q’\dq _ 24 q 2q° 
=*(,; =4(r4 2) 2944. 
de2 dq r J} do r r r2 


But did Gregory know the chain rule? Before we venture an opinion, consider 
the firs seven derivatives of g = r tan 0, which he gave later in the manuscript 
under discussion in the following form [p. 352]: 


(xiii) jst gnd 3rd qth 
2 3 2 4 
8 6 
m=q m=r+— m=2qg+— sop & Z 
r r | i 
5th 6th 
40q*_ —_24q? 136q* 240g* ~—-:120q° 
m = 16q + 4 + 4 m = 16r 4 Z “4 o¢ 
id vr r r 
7th 


3 5 7 
m = 272q + 987& + 16804 + 7204 
r if r 


gih 
2 4 6 8 
m = 272r + 32334 4 113614 + 134404 + soaoL 
r r r r 


Now, if Gregory did not know the chain rule he must have needed a lot of 
additional scrap paper before he wrote these derivatives down on the scrap 
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paper that contains them. Be that as it may, the derivatives—for that is what 
they are called now—are correct, except for the coefficien 978 in the sixth, 
which should be 1232. The error carries on to the last line, in which the second 
and third coefficient should be 3968 and 12096 instead of 3233 and 11361. 
It seems to be the consequence of a trivial mistake, which is, for our purposes, 
very significant Computing the sixth derivative from the fift using the chain 
rule requires the addition of the terms 


2(136)q q7 and 4(240)q° 


re r r3 


among others. Thus, the coefficien of g?/r? in the seventh derivative should 
be 272 +960 = 1232, but, in an obvious slip, Gregory added 27 +960 = 987. 

Putting this mistake aside, but only for a moment, the rest of Gregory’s 
manuscript contains the same type of successive differentiations for six other 
functions, one of them being g = r sec @. And six of these seven are functions 
whose power series expansions he gave in the subsequent letter to Collins. 
Thus, we have the following partial answer to the question that we posed 
before: yes, Gregory knew how to compute, and in fact did compute, the 
derivatives necessary for a Maclaurin expansion of those six functions. Still, 
he could have obtained the expansions in some other manner. But besides 
the obvious fact that the repeated differentiations evaluated on Shaw’s letter 
would then have been pointless, there is a clincher. The error that Gregory 
made in computing the last two derivatives of r tan 6 appears again, two weeks 
later, in the second series that he sent to Collins: the numerator 3233 on the 
fift term of the series for t (page 222). The evidence is pretty conclusive, and 
then the conclusion inescapable. Gregory used the successive differentiations 
on Shaw’s letter to obtain six of the series expansions that he sent to Collins. 
In other words, Gregory discovered the Maclaurin series expansion before 
Taylor and Maclaurin were born, 41 years before Taylor discovered it. 

Why did Gregory not publish his results? His intention to publish was 
clearly expressed in his letter to Collins of 23 November 1670 [p. 118; 45]: 


I have almost readie for the presse another edition of my quadratura circuli 
& hyperbole, wherein (if I be not much mistaken) I demonstrat my intent 
many & several ways; I purpose also to add to it several universal methods, 
as I imagine, as yet unheard of in Mathematicks, both in geometrie and 
analyticks: I am afrayed I can have it but naughtilie don here; and therfor I 
humblie desire your concurrence to try how I can have it don in London, and 
advertise me with the firs occasion. 
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Those “unheard of universal methods” must have been the interpolation the- 
orem, the binomial theorem, and the Taylor—Maclaurin series, already in his 
mind. It may be doubted that this quotation refers to the Maclaurin series, 
which was firs given later in the Shaw letter the following January, but Gre- 
gory had actually announced it in November 1670 [p. 120; p. 47]: 


Ihave a methode also whereby I turn ageometrick problems (at least al I have 
yet considered) into an infini series; and amonge others Keplers probleme 
... Tresolve with great ease. 


So he was enthusiastic about his discoveries and ready to publish. What hap- 
pened that deprived him of so much deserved credit? In November, Gregory 
had received only one of Newton’s series from Collins, but then Collins wrote 
on 24 December 1670, expressing himself as follows [p. 154; 54]: 


I have since had some few Series more out of Newtons generall method ... 
the method is universall and performes all Quadratures ... in this method, 
the curved lines of all figure that have a common property, are streightened, 
their tangents and Centers of Gravitie discovered ... the length of the Curved 
line being given an Ordinate is found and the Converse. 


In short, Newton was ready to unveil the universal panacea. As a sample, 
Collins included four of those series that he got out of Newton, including 
those for the sine, cosine, and arcsine [p. 155; 54]. To this, Gregory replied 
on 15 February 1671 [p. 170; 62]: 


As for M' Newton his universal method, I imagine I have some knowledge 
el ie 


And it was at this point that he included the seven series already discussed. 
But it is possible to read an undercurrent of disappointment in this statement. 
Gregory might have assumed that Newton already knew everything that he 
had recently discovered himself and thought it was pointless to publish. The 
fact is that he changed his mind about publishing, expressing his decision as 
follows in the same letter of 15 February 1671 [p 171; 63]: 


39 As we shall see in Chapter 5, Gregory had already published how to fin “the length of 
the Curved line being given an Ordinate” in his Geometrie pars vniversalis of 1668. 
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I thank you werie hertilie for your good advice, as to the publication of my 
notions, & for your civil profer; I would be werie sorrie to put you to so much 
trouble. I have no inclination to publish any thing, safe only to reprint my 
quadrature of the circle, and to add some little trifle to it. 


Was this loss of interest in publishing compounded by Collins’ warnings about 
the dire publishing situation in London? In the same letter of 24 December, 
Collins said [pp. 156; 55-56]: 


There is not any Printer now in London accustomed to Mathematicall 
worke, or indeed fitte with all convenient Characters, and those hand- 
some fractions but M' Godbid where your Exercitationes were printed, 
and at present he is full of this kind of worke to wit ... 


And here he proceeded to give a long, exhaustive list of all those works in the 
queue to be printed. Was this the trouble that Gregory hesitated to put Collins 
through? Whatever the reasons, the three most important pieces of Gregory’s 
lifetime work remained unpublished. 

Newton was not in possession of such methods. Of the new results that 
Gregory had obtained and could have published, Newton had discovered only, 
at that time, the binomial series, as shown in Section 2.5. But Newton was a 
giant, and fi e years later he carried out, and eventually even published, his 
work on interpolation as mentioned in Section 4.3. The next step took him 
a little longer, but he discovered the Taylor series twenty years after Gregory 
but still twenty years before Taylor. This, however, he never published. The 
Maclaurin and Taylor expansions are the most important achievement in New- 
ton’s manuscript De quadratura curvarum.*° At some point Newton stated 
the following result [p. 93]. 


PROPOSITION XII. 


Out of an equation involving two fluent quantities, either alone or together with 
their fluxions,*' to extract one or other quantity in an unterminated converging 
series. 


40 This is the original manuscript of November—-December 1691. Page references are to 
its printing in Whiteside, The mathematical papers of Isaac Newton, VII, 1976, pp. 48-129. 
For the many versions of this work, see the bibliography. 

41 We are getting ahead of the story here. Fluent quantities (that is, quantities that vary 
with time) and their fluxion (or the rates at which they vary) will take center stage in the 
next chapter. 
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Then, after considering three cases, he gave four corollaries, of which 
we reproduce here the third and fourth, and then a translation of the third, 
admittedly out of context. Let us just say, for the benefi of today’s reader, 
that a dot on top of a variable denotes its derivative with respect to time, two 
dots the second derivative, and so on. But it is neither possible nor necessary 
to fully understand Newton’s corollaries at this point [pp. 97-99]. 
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Newton’s discovery of the Taylor series. 
From Whiteside, The mathematical papers of Isaac Newton, VII, Facing page 98. 


Corol. 3. Hence, indeed, if the series proves to be of this form 


y=azt+bze+ez7+dz+ez° + &e 
(where any of the terms a[z], b[z7], c[z*], d[z*], ... can either be lacking or 


be negative), the fluxion of y, when z vanishes, are had by setting z = 4, 
Zz 


= 2b, > =6c, > = 24d, X = 1208. 
vg z 
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That is, 


_ 
222° " 623° * 24z4* * 12025 


+ &c, 


which, if we set z = t, so that z = 1, and insert the omitted term yo = y(0), 
gives the Maclaurin series 


_ : Yo Y 3 y4 a a 
Y= YT IT yz t 32 + ae + 52 ; 


Corollary 4 is similar but gives the full Taylor series, as shown in the photo- 
graphic reproduction of Newton’s manuscript. 

There has been too much talk about derivatives and integrals in this section. 
It is clear that the calculus had already been discovered when these general 
expansions were obtained, but we have not talked about it. We must, therefore, 
interrupt the story on infinit series to present the development of the calculus 
before we embark on the thorny question of convergence. 


THE CALCULUS 


5.1 THE ORIGINS 


We have seen some of the roots of the integral calculus in the method of 
exhaustion to perform quadratures, at least as exemplifie by Archimedes’ 
quadrature of the parabola. These ideas were to be resurrected and perfected 
in the seventeenth century. 

Anancestor of future ideas about what we now call the differential calculus 
can be found in the fourteenth-century study of the /atitude of forms. To put it 
in very simple terms, a form—a concept of Aristotle—was what we now calla 
function, for it was thought of as a quality subject to variability, such as veloc- 
ity, temperature, or density. John Duns Scotus (1266—1308) was one of the firs 
to consider the increase and decrease (intensio and remissio) of the intensity 
of forms. For the intensity of a form must be distinguished from its exten- 
sion: velocity, temperature, and density are intensities, while distance traveled, 
heat, and mass are extensions. But fourteenth-century Scholastic scholars, es- 
pecially at Merton College, Oxford—in particular, William Heytesbury and 
Richard Suiseth (or Swineshead)}—also studied such fin points as the rate of 
change and the rate of change of the rate of change of a form, of course in a 
verbal manner and not always separate from philosophy or mysticism. 

In a manuscript entitled De motu (On motion), attributed to Suiseth, he 
classifie all motions into two kinds: uniform [pp. 243; 245]' 


1 The quotations in this section are from Clagett, The science of mechanics in the Middle 
Ages, 1959. This work contains the Latin texts of the manuscripts by the Merton College 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 230 
DOI 10.1007/978-0-387-92154-9_5, © Springer Science+Business Media, LLC 2011 


SECTION 5.1 THE ORIGINS 231 


in which in every equal part of the time an equal distance is described. 
and difform: 


Difform motion [difformis motus] is that in which more space is acquired in 
one part of the time and less in another equal part of the time. 


In turn, difformis motion was classifie into uniformiter difformis and dif- 
formiter difformis, according to whether the instantaneous rate of change of 
the rate of change was constant or not, and further subdivisions were con- 
sidered. For instance, in 1335 Heytesbury gave the following definitio in 
Regule solvendi sophismata (Rules for solving sophisms), Part VI: Local Mo- 
tion [pp. 237; 241]: 


For any motion whatever is uniformly accelerated [uniformiter intenditur] 
if, in each of any equal parts of the time whatsoever, it acquires an equal 
increment [latitudinem] of velocity. 


The scholars at Merton College arrived at a theorem for the mean intensity 
of a uniformiter difformis form. It was stated firs by Heytesbury [pp. 270; 
277], and then by Suiseth in De motu as follows [pp. 244; 245-246]: 


Wherever there is uniform increase [intensio] of local motion, the local mo- 
tion is uniformly difform motion. Since local motion uniformly difform cor- 
responds to its mean degree [of velocity] in regard to effect, so it is evident 
that in the same time so much is traversed by means of [a uniform movement] 
at the mean degree as by means of the uniformly difform movement. 


More precisely, if an object travels a distance d in time f, starting with initial 
velocity vj, moving with constant acceleration, and reaches a fina velocity vs, 
then “in the same time [t] so much [d] is traversed” as if the object moved “at 
the mean” constant velocity 

vj + UF 

2 

In short, the average velocity of a body moving with constant acceleration is 
the mean of its initial and fina velocities. 


scholars and translations of large passages from them. Page references given here are to 
this edition; the page on the left of the semicolon refers to the English translation, and the 
one on the right to the original in Latin. 
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Several proofs were provided, some based on manipulations with infinit 
series. The leading fourteenth-century treatise on the latitude of forms was the 
Liber calculationum (probably composed in the second quarter of the century) 
of Richard Suiseth, who became known as Calculator. It contains the Regule 
de motu locali (Rules of local motion), in which Suiseth gave four proofs, of 
which the third is the most interesting of all the Merton College proofs.” 

When this subject reached the Continent, Nicole Oresme clarifie the pic- 
ture by introducing graphs to distinguish the different types of variation in 
the latitude of a form.? He made this approach an integral part of his work 
De configurationibus qualitatum, which was probably written before 1361. 
His geometric representation is proposed in the following way in Chapter | 
[pp. 347; 368]. 


Hence every intension which can be acquired successively is to be imagined 
by means of a straight line erected perpendicularly on some point or points 
of the [extensible] space or subject of the intensible thing.* 


Inshort, we havea system of coordinates. On the horizontal axis (the longitudo 
axis in geographic terms) we represent the extension of the form and on the 
vertical axis the Jatitudo or intensity of the form. 

Later, opening Chapter 11, Oresme added [pp. 352; 372]:° 


Thus every uniform quality is imagined by a [rectangular] quadrangle [that is, 
a horizontal line at a constant height], and every quality uniformly difform 
terminated at no degree [ending at zero] is imaginable by a right triangle. 
Every quality uniformly difform terminated at both ends at some degree is 
to be imagined as a quadrangle having right angles on the base and the other 
angles unequal. Moreover, every other linear quality is called “difformly 
difform” and is to be imagined by figure disposed in other and considerable 
varying ways. 


? It is reconstructed in modern notation by Clagett, The science of mechanics in the 
Middle Ages, pp. 295-297. 

3 Tf it comes to questions of priority, Giovanni di Casali had used graphs for the same 
purpose before Oresme, but not to the same extent or with the same influence In this 
connection see Clagett, The science of mechanics in the Middle Ages, pp. 332-333. 

4 “Omnis igitur intensio successive acquisibilis ymaginanda est per lineam rectam per- 
pendiculariter erectam super aliquod punctum aut aliquot puncta spacii vel subiecti illius 
rei intensibilis.” 

5A slightly different translation can be found in Clagett, Nicole Oresme and the medieval 
geometry of qualities and motions, 1968; reprinted in Calinger, Classics of Mathematics, 
1995, p. 255. 
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Uniform Uniformly difform 


Difformly difform 


The preceding discussion notwithstanding, the firs work that we can rec- 
ognize as properly belonging to the subject now called the calculus is Kepler’s 
book on the solid geometry of wine casks.® In spite of its title, this is a book on 
mathematics, largely devoted to evaluating the volumes of solids of revolution. 
For example, a chord in a circle that is not a diameter divides the circle into 
two parts of unequal areas. If the largest is rotated about the chord, it produces 
a solid “which is of the form of the fruit of the apple tree”; if the smallest is 
rotated it produces “what can be called the figur of a lemon” [Supplement 
to Part I, p. 576]. All in all, Kepler considered the volumes of 96 solids of 
revolution.’ Later he turned to the problem of findin the best dimensions for 
wine casks and was able to determine that the largest right parallelepiped with 
a square base that can be inscribed in a sphere is the cube [Part II, Theorem IV, 
pp. 607-609], and that of all right circular cylinders with equal diagonals the 
largest is the one that has a ratio of diameter to altitude of J/2 to 1 [Part II, 
Theorem V and Corollary II, pp. 610—612]. 

After Kepler, many seventeenth-century mathematicians worked on prob- 
lems of findin tangents and quadratures. The list includes, but is not limited 


8 Nova stereometria doliorum vinariorum, 1615 = Opera omnia, IV, 1863. Page refer- 
ences are to the Latin text in the Opera. 

7 The volume of an apple is obtained in the Corollary to Theorem XIX of Part II, which 
can be seen in English translation in Struik, A source book in mathematics, 1200-1800, 
1969, pp. 192-197; reprinted in Calinger, Classics of Mathematics, pp. 356—360. 
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to, Galileo Galilei, Bonaventura Cavalieri, and Evangelista Torricelli in Italy; 
Blaise Pascal, René Descartes, and Gilles Personne de Roberval in France; 
Frans van Schooten, René Francois de Sluse, Johann Hudde, and Christiaan 
Huygens in the Low Countries; and John Wallis in England. No history of 
the calculus would be complete without an account of their contributions, 
but it would be impossible to do that here without turning this chapter into a 
book,® so we shall restrict our attention to the work of fi e men in the 1600s, 
each of which has been called the true inventor of the calculus by someone 
at some time: Pierre de Fermat, James Gregory, Isaac Barrow, Isaac Newton, 
and Gottfried Wilhelm Leibniz.? 


5.2 FERMAT’S METHOD OF MAXIMA AND MINIMA 


Pierre de Fermat (1601-1665), a lawyer working in the parlement of the city 
of Toulouse, found some spare time to work on mathematics. The fact that 
he was an amateur mathematician notwithstanding, he is considered to be the 
best French mathematician of the seventeenth century. 

About 1629 Fermat produced an algorithm to fin maxima and minima, 
which is contained in a copy of his original manuscript sent in 1637 by Rober- 
val to Marin Mersenne under the double title “A method to investigate maxima 
and minima” and “About the tangents to curved lines.” '° It was widely circu- 
lated in mathematical circles in Paris—Mersenne himself sent a copy to René 
Descartes—but it was not well received. It simply gives an algorithm to fin 
maxima and minima but includes no supporting explanations whatsoever. 

If Fermat’s method was found unclear by his contemporaries, used as they 


8 There are, in fact, some comprehensive works on the history of the calculus. For 
instance, Baron, The origins of the infinitesimal calculus, 1969 (Dover Publications, 1987); 
Boyer, The concepts of the calculus. A critical and historical discussion of the derivative 
and the integral, 2nd. ed., 1949 (reprinted by Dover Publications as The history of the 
calculus and its conceptual development, 1959); and Edwards, The historical development 
of the calculus, 1979. 

°1’ll exhibit my own prejudice on this matter at the outset. The calculus was not 
invented—not in the sense that many gadgets have been invented—but whatever word one 
wants to apply to its development, it came about as the cumulative result of work by many 
people—much as in the case of geometry and algebra—some more insightful, some more 
extensive. Some represented a breakthrough but some did not. Moreover, the calculus as 
taught today is a product of the nineteenth century, owing much to the rigorization efforts 
of da Cunha, Cauchy, Riemann, Weierstrass, and others. 

10“Methodus ad disquirendam maximam et minimam” and “De tangentibus linearum 
curvarum.” 
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PIERRE DE FERMAT 


From D. E. Smith, Portraits of Eminent Mathematicians, 
Portfolio H, Scripta Mathematica, New York, 1938. 


were to the terminology of the times, it is to be expected that today’s readers 
would fare no better. Thus, we shall adopt the following course of action: 
firs we include a translation of the firs part of this manuscript—a printed 
copy of which can be seen in the next photograph—up to and including the 
example; then we attempt to clarify Fermat’s steps and interpret them in today’s 
terminology; and finall , we shall include a summary of his own explanations 
as to how he got the idea for the method. 

The translation (from the printed copy reproduced on the next page) is as 
follows, but we have inserted roman numerals, which are not in the original, 
to number the steps in the algorithm so that we can refer to these steps later. 


The whole science of the discovery of maxima & minima, rests on the po- 
sition of two unknown [expressions], & this unique rule; (7) let A be any 
unknown of the problem, whether plane, or a solid, or a length, according 
to the proposition to be satisfied & (i) express the maximum or minimum 
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METHODUS 


Ad difquirendam maximam & minimam. 


% MNIS deinyentione maximz & minima dottrina , duabus positions 
bus ignotisihniticut, & hacunica preceptiones Rattatur quilibet guetta 
A iis terminus effe A, five planum, five folidum , autlongitide, prout pro 

pofito (atisfieri parelt , & inventa maxyna aut minima in tenminis tub A, 
gradu ut Jibet inooluriss Ponartr rurfus idem qui prius ofl terminus A, 
+ E, iteramque inyeniatur maxima que minima interminis lub A&E , eradibis uc 
Tibet coefficientibus. Adzquentur, utloquitur Diophanms. duo homogenta mavime 
aut minime equalia & dem ptis communibus ( quo peratto homogenca omnia cx pare 
aleerurra (ab Evel ipfius gradibus afticiunrur ) applicenrar omnia ad E, vlad clita 
rem ipfiusgradum , doneg aliquod ex homogencis, ex parte utvayis affections fabE, 
omnino fiberctur. 

Elidantur deinde uceimque homogenca fub E, aut iplius gradibus quomodoliber m- 
Volura & religua equencur. Aurficx und parte nihil firpereft rquentur fine , quod co- 
dem recidir, nogaraad firmatis, Refolutio ultime iftins equatitatis dabir valorem A, 
qua cognita, maxim aut minima ex tepetitis priors refolutionis veltigiis innotefccr. 

Exemplum (abijcimus 


Sir recta AG, ita dividends in E, ur retang. AE C, fir maximum; Reha AC, di- 
carur B, 


A E c 


u a 
ponatar pat ahora B, cle A, ergo ichiqua crit B, — A, & reétang. fab fegmentis ent 
B, in A, —A®* quod deber inmenizi maamum, Ponatur rurfis pars altera iptius By 
cle A, + B, ergd reliqua etitB, —,A— E, & reétang, Sub, {egmentis eric Ban A 

,A? =~ B.in E,’E in A, —E, quod debet adaquati luperior! reGang, B,in A, 

A‘, dempuy commanibus B, iy E,adrguabitur A, inE’ = E s écommbus per 
E, divifis B, adzquabimr “A -» E, clidaturE, B, equabirur “A, igieur B y bitusam 
ct diyideudt, ad folitioncin propoli, ave potelt gencralior dart merhodus 


AS Beafi Pe 98 Reco Beak Be of feeh Be MG Jesh Seah Qe ok Seh Gee dere SPE 
De Tangentibus linearum curvarum. 


D fayeriorem imethodtim inventionem Tangentiumt ad data pandts in hace 
quibsleummaue curvis reduciumus * 
4 


First page of the printed copy of Fermat’s Methodus 
in his Varia opera mathematica, 1679. 


in terms of A, using terms that can be of any degree whatever; (iii) Posit in 
turn that the same term as before is A + E, and express again the maximum 
or minimum in terms of A & E, using terms of any degree. (iv) Adequate, 
as Diophantos would say.'' the two homogeneous [expressions] giving the 
maximum or minimum & (v) remove the common terms (when done all terms 
in the homogeneous [expressions] on either side will contain EF, or powers 
of it) (vi) divide all terms by EF, or by a higher power of it, until any of the 
homogeneous [expressions], on one side or the other, is completely free of E. 


11 Fermat was borrowing here a word and a meaning from a translation of Diophantos’ 
Arithmetica by Holtzman, 1575. He owned a copy of its republication with added commen- 
tary by Claude-Gaspard Bachet de Méziriac, 1621. 
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(vii) Remove then from both sides all the terms involving E or a power 
of it in any manner, and (viii) adequate the rest. Or, if nothing remains on 
one of the sides reasonably adequate, which amounts to the same thing, the 
negative terms to the positive. The solution of this last equation will give 
the value of A, which once known, the maximum or minimum will become 
known by repeating the steps of the preceding solution. '2 


Now, with the aid of the inserted roman numerals, we can simplify and 
rewrite Fermat’s algorithm in our own terms as follows: 


(i) Let A be the variable in the problem, for which we shall write x. 


(ii) Express the quantity to be maximized or minimized as a function of the 
variable. We shall denote it by f(x). 

(iii) Increment the variable as A + E, for which we shall write x + h, and 
evaluate f(x + h). 

(iv) “Adequate” (in the sense of setting approximately equal) the two pre- 
ceding expressions: 


f(x +h) © f(x). 


(v) Simplify by removing terms that appear in both f(x + A) and f(x). 
(vi) Cancel any power of h higher than the firs that is a common factor of 
both f(x + h) and f(x) and then divide by h, which is equivalent to 
setting 
f(x +h) — f(x) a0 
ine aie . 


(vii) Disregard or strike out any terms that still contain / or a power of h. 


(viii ) “Adequate” the rest (to 0 since we had moved f(x) to the left-hand 
side). 


With this interpretation, part of Fermat’s algorithm is what we do today, at 
least to the evaluation of what we call the difference quotient, but then we take 
the limit as h — 0 and actually equate it to 0. However, the concept of limit 
was not available to Fermat and he did not create it. Thus, he had to make do 
with disregarding higher powers of # and working with an approximation. Of 


12 The Latin version of this quotation, as well as Fermat’s example given below, can 
also be seen in Euvres de Fermat, 1, 1891, pp. 133-136. There are two somewhat different 
English translations of the preceding paragraphs that the reader can consult, and the detailed 
references are given in the bibliography. The one by Struik includes the translation of 
Fermat’s complete manuscript. 
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course this is valid only if / is very small, a claim that he did not make at any 
stage of this procedure, but seems to have had in mind. 

An example would be welcome at this point, so we consider the problem of 
splitting a straight line segment AC of length B into two portions by a point E, 


A E Cc 


and the problem is to fin £ such that the product AEx EC is a maximum. 
Here are the problem and its solution as stated by Fermat. 


We exhibit an example. 

Divide the straight line [segment] AC at EF, so that the rectang. AEC 
[meaning the area AE x EC] is a maximum; Denote the segment AC by B. 
Let A be one part [segment] of B, so that the rest will be B — A,'? & the 
rectang. [product] of the segments will be BA — A” which must become a 
maximum. Letinturn A+ E be one part of B, so that the rest will be B—A—E, 
& the rectang. [product] of the segments will be BA— A*+ BE—2EA- E”, 
which must be adequated to the rectang. given above B.A — A’, remove the 
common terms [to obtain] BE, adequal to 2AE + E’, & divide all by E [to 
obtain] B, adequal to 2A + E, remove E, therefore adequate 2A to B, [and] 
divide by 2, to [obtain] the proposed solution, it is not possible to give a more 
general method. 


And here it is restated in our own terms. 
(i) Let x = AE. 
(ii) f(x) = x(B— x) = Bx — x’. 
(iii) fe th) = (e+ h)(B—x — A) = Bx — x* + Bh—2hx — Pr’. 
(iv) “Adequate”: Bx — x? + Bh —2hx —h? ® Bx — x’. 
(v) Simplify: Bh © 2xh + h?. 
(vi) No power of / other than the firs is a common factor of all the terms, 
so just divide by h: B © 2x +h. 


(vii) Disregard the term that still contains h: B © 2x. 


(viii) The rest is already “‘adequated,” and the solution is x = B/2. 


13 Here Fermat is being naughty in denoting by A one of the segments in which AC is 
divided by E. Also, notice from the photograph that we have changed the notation B, —A 
to B — A and we shall also write BA for the product of B and A rather than B, in A. 
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Today we would obtain the same answer by setting f(x) = Bx — x? and 
then 


». Fa+h) = F@) . Bh—2hx —h? 
lim = lim 
h-+0 h h->0 h 


= B-—2x=0, 


and it is clear to us that Fermat had the seed of the present method. 

To his contemporaries, in particular to Descartes, he was at best obscure 
and at worst accused of having found the algorithm by trial and error (a 
tatons and par hasard ). Fermat was stung, and reacted by producing several 
explanations of his algorithm. The most important for our purposes is in 
a manuscript that begins with the words “Dum syncriseos et anastrophes 
Vietze methodum expenderem,”"* probably written about 1639 or 1640 '® to 
directly refute Descartes’ allegations. This paper, which we shall simply call 
Syncriseos, begins with an acknowledgment of Fermat’s source of inspiration: 


When considering Viéte’s method of syncrisis and anastrophe, ... a new 
method derived from it came to my mind for findin maxima and minima 


Viéte was fond of creating Greek neologisms, and “syncrisis” was one of them. 
The key element in the method of syncrisis is the assumption that a polynomial 
of order n has up to n roots and the relation between those roots and the order 
of the polynomial. '® 

In the second paragraph of this paper Fermat revealed his second source 
of inspiration to be a result of Pappos of Alexandria, which he later identifie 
more precisely [p. 151] as Proposition 61 from Book VII of the Synagoge or 
Mathematical collection.” The opening statement of this lemma is this: 


Given three straight lines [segments] AB, BC, C Dif one makes the rectangle 
ABD to the rectangle AC D, as the square on BE to the square on EC, the 


14 1¢ appears in Fermat, Guvres, I, p. 147, with the title “Methodus de maxima et minima,” 
chosen by the editors. Page references are to this work. 

15 For the reasons for this dating see Mahoney, The mathematical career of Pierre de 
Fermat 1601-1665, 1994, p. 145. 

16 Syncrisis is explained in Chapter XVI of De recognitione equationum = Francisci 
Vietee opera mathematica, in unum volumen congesta, ac recognita, 1646, pp. 104 ff; English 
translation in Witmer, The analytic art by Francois Viete, 1983, pp. 207ff. 

17 This is a work on classic Alexandrian geometry, with some new proofs and additions, 
written in the late 200s. This proposition can be found in Commandino’s translation: 
Mathematice collectiones, 1588, f. 196". 
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ratio is unique, & smallest of rectangle AE D to rectangle BEC ae 


This statement is to be interpreted with reference to the following figure which 


+ + 4 
A B E Cc D 


is a small part of Pappos’ figure Although Commandino, the translator, con- 
fessed to not understanding the end of this statement—which he had just ren- 
dered as “singularis proportio, & minima est rectanguli AE D ad rectangulum 
BEC”—Fermat did understand. Unfortunately, he changed Commandino’s 
notation in his explanation. Using the original notation to avoid unnecessary 
confusion, the problem that the proposition poses is [p. 151]: 


Let ABCD be a straight line, in which the points A, B, C, D are given. 
Choose a point FE between the points B and C, such that the rectangle AE D 
to the rectangle BEC has minimum ratio. 


The solution given by Pappos in Proposition 61 is to choose E such that 


ABXBD BE? 
ACXCD EC2 


Fermat had solved this problem in a manuscript written before Syncriseos, 
and this led him to a quadratic equation.'® Such an equation normally has two 


18 Recall that “the rectangle ABD” means the area ABx BD, and similarly for the other 
rectangles. 

19 In publication, the title of the manuscript is “Ad eamdem methodum.” It is possible 
to obtain both the quadratic equation and Pappos’ solution rather quickly using today’s 
differentiation. Write (as Fermat wrote in Syncriseos, except that we must use lowercase 
letters, while he used uppercase ones) BC = b, BD = z, AB = d, and BE = a. 
As is frequently the case in Fermat’s writing, a is the variable. Then, according to his 
interpretation, we have to minimize 


AExED _ (d+a)(Z—a) | dz—da+za—a? 
BExEC  a(b—a) ba — a? 


Then equating the derivative of the right-hand side with respect to a to zero and simplifying 
gives 


(b+d—z)a? — 2dza + bdz =0 
[“Ad eamdem methodum,” in Varia opera, p. 67 = Guvres, I, p. 144; Syncriseos, p. 152]. 
This is the quadratic equation, which Fermat obtained by syncrisis (to be explained shortly) 
rather than by differentiation. Multiplying it by b, and then adding and subtracting dza? to 
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solutions. But then he focused attention on the words that Commandino did 
not understand and realized that at the minimum there is only one solution. 
What happened to the other solution? 

To fin out, Fermat started by considering a simpler problem, and found 
it in Euclid’s Proposition 27 from Book VI of The Elements. It is the one he 
presented in the Methodus: split a segment of length B into two parts A and E 
so that the product of A and E is largest. He knew the solution because it was 
given by Euclid as “that parallelogram is greatest which is applied to half of 
the straight line” 2° and its area is B7/4. 

But in Syncriseos Fermat was in search of the two solutions of a quadratic, 
and modifie the problem by asking how the original segment should be split 
so that the product of the resulting segments is a constant Z < B7/4. In his 
own words [p. 148]: 


But, if it is proposed that the same straight line B is to be cut under the 
condition that the rectangle [product] of the segments is equal to an area Z 
(which is supposed to be smaller than one fourth of B square), then two points 
satisfy the proposition, and the point of maximum rectangle is between them. 


He meant that there are two solutions of the quadratic that we would write 
now as A(B — A) = Z (but he wrote in Viéte’s style as “B in A — A quad. 
zquale Z plano”), and denoted these solutions by A and E. Here is Fermat’s 
view of what happens to these solutions as Z approaches its maximum value 
B?/4 [pp. 148-149]: 


If, in place of the area Z, we take another that is larger than the area Z, but 
smaller than one-fourth of B square, then the straight lines [segments] A 
and E will differ from each other by less than the ones above, as the points 
of division more nearly approach the point constitutive of the maximum 
rectangle,2' always, as the rectangles of the divisions increase, the difference 


the left-hand side, we obtain 
(b> + db — bz — dz)a* — 2dza +b’ dz + dza* = 0, 
which, after factoring and rearranging, can be written as 


dz a 


(d+b\(z—b) (b—ay 
This is Pappos’ solution if the lowercase letters are replaced by the segments they represent. 
20 Quoted from the Dover Publications edition of The Elements, vol. 2, 1956, p29: 


21 This is Fermat’s intuition, for he offered no proof of this or of the fact the “the point of 
maximum rectangle lies between” A and B. 
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of these A and E will decrease, until with the last division of the maximum 
rectangle it vanishes, in which case a wovayoc [Pappos’ word] or unique 
solution will be produced, since the two quantities [become] equal, that is, 
A will equal E. 


Now, if 
A(B—A)=Z and E(B—E)=Z, 


then, by the method of Viéte (“syncrisis”), 


A(B — A) = E(B — E) 
or, rearranging, 
B(A — E) = A* — E’. 


Dividing by A — E, we obtain B = A+ E (which in this very simple example 
is geometrically obvious). Fermat drew the following conclusion [p. 149]: 


Since, therefore, in the two correlate equations above, by the method of Viéte, 
B will equal A + E, if E is equal to this A (which appears to hold always at 
the point constitutive of a maximum or a minimum), then, in the proposed 
case, B will equal 2A: that is, if the straight line B is cut in half, the rectangle 
[product] of its segments will be a maximum. 


This is the essence of Fermat’s method to fin maxima and minima, which he 
then applied to another example [p. 149]: 


To cut the straight line B, in such a manner that the square of one of its 
segments [call it A again] times the other is a maximum. 


In this case we have to maximize A?(B — A). Then consider the equation 
BA? — A? = Z, where Z is a constant smaller than the maximum value. If E 
is another solution of this equation, then, by syncrisis, 


BA? — A? = BE? — E?, 
or 
BA = Py) aS: 


and dividing by A — E, 


B(A+ E) = A°+AE+E’. 
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At the maximum, A = E, and then 


2ph=34", 
so that A = 2B/3. 
But Fermat noted at this point that division by binomials was exceedingly 


laborious and generally troublesome (“‘operosa nimis et plerumque intricata’’) 
[p. 149], and put forth the following idea [p. 150]: 


However, since E (as well as A) is an uncertain quantity, nothing prevents 
us from calling it A + E. 


Then the difference of the two solutions is just E, a much easier quantity to 
divide by. While Fermat showed his new approach by a new example—and 
went on to crown the Syncriseos with the solution of Pappos VII, 61—we shall 
not do that. Instead, to make a long story a little shorter, we shall apply it to 
his firs example. Then, by syncrisis, 


A(B — A) = (A+ E)[B—(A+)], 
or, simplifying and rearranging, 
BE =2AE +E’, 


just as in the example included in the Methodus. Dividing by E and recalling 
that at the maximum “EF will give nothing” [Syncriseos, p. 150], we obtain 
B = 2A. The solution is, as before, A = B/2. However, since he was hiding 
any explanation of his methods in the Methodus, he needed an explanatory 
gimmick, and he found it in Diophantos’ adequality. 

The Methodus also included a procedure to fin tangents, and posing the 
problem is quite simple:22 


Let there be given, for example, a Parabola BDN, whose vertex is D, the 
diameter DC, & a point B given on it, to extend a straight line BE, tangent 
to the parabola, & meeting the diameter at a point E. 


To solve the problem, Fermat took a point O on BE located over the segment 
CD and drew OI parallel to the ordinate BC. In Fermat’s time there was no 
equation of a parabola as we know it today, but if we think of a rectangular 


22 Varia opera, p. 64; uvres, I, p. 135. English translation in Stedall, Mathematics 
emerging, 2008, pp. 73-74. 
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coordinate system with the origin at D, the positive x-axis in the direction 
DC, and the positive y-axis horizontally to the left, it is clear from what he 
wrote next that he was dealing with the parabola x = y”. Then, if the ordinate 
at x = DI is denoted by y;, we see that y; < OJ, and then 


But from similar triangles, 


BC? CE 

OI2 TE? 
and therefore 

CD CE? 

er: > ——aee 

DI” IE2 


At this point Fermat managed to be naughty again by using some of the 
old letters with new meanings as follows: 


But since the point B is given, the applied line BC is given, hence the point C; 
is given as well as C D; therefore let CD be equal to a given D. Let CE be 
A; let CI be E. 


Why not? After all, the alphabet is limited. In the new notation the last 
inequality becomes (translating once more from Fermat’s narrative form to 
present-day usage) 
D A? 
> ’ 
D—E° A’+E* —2AE 


SECTION 5.2 FERMAT’S METHOD OF MAXIMA AND MINIMA 245 


or 
DLE DE IDA = DA SAE: 


And now we shall let Fermat have the last word to avoid the blame for any 
unclarity in his statement. 


Adequate therefore according to the method above: with common terms re- 
moved, DE” —2DAE ~ —A’E [© meaning “will be adequal to”], or what 
is the same, DE* + A?E ~ 2DAE. Let everything be divided by E, hence 
DE + A* © 2DA, remove DE, hence A? = 2DA, and thus, A = 2D, 
therefore we have proved that C E is twice CD, which indeed holds true. 


In this manner, almost in one breath, an inequality turned firs into an adequal- 
ity and then into an equality. Fermat never gave any subsequent explanations. 

But today’s readers deserve some interpretation in current terminology, 
and to provide it we turn to the equation that Fermat obtained from similarity 
of triangles, which can be rewritten as follows: 


OI IE 
BC GE 
Referring now to the rectangular coordinate system with origin at D that was 


mentioned above, defin x = DC, h = x — DI and write the arc BD of the 
parabola as y = ./x. Then BC = ./x, IE = CE —h, and DI = x —h. If we 
use the adequality OJ ~ \/x —h, the equation of similarity can be replaced 


with 
Vx—h  CE-—h 
af CE 
Subtracting 1 from each side and simplifying, 
Vx —h—Jx 2 h 
ef CE 
or 
PRR ATE 2. lh 
—h CE 


Now multiply numerator and denominator of the fraction on the left-hand side 
by /x —/ + ./x and then simplify to obtain 


1 2, we 
Vx—-h+Jx CE 
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At this point, we would be happy to let h — 0, but the last step in Fermat’s 
procedure was to discard h (his £) and to turn the adequality into an equality: 


1 _ vx 
2/x CE 
and it follows that the subtangent is CE = 2x, the same result that in his 
notation was A = 2D. 

Descartes had quite a number of objections to Fermat’s method, and it is 
quite clear that some of them are as justifie as being impossible to address 
with the mathematical knowledge available at that time. Nevertheless, in 
response to these criticisms Fermat elaborated and improved on the method. 
But Descartes also thought that this method was applicable only to functions 
given in what we call explicit form and challenged Fermat to fin the tangent 
to the curve that we now write as x* + y? = Pxy,?% then known as la galande 
and now as the folium of Descartes. Rising to this challenge, Fermat promptly 
found the tangent that Descartes demanded, sending it to him (via Mersenne) 
ina manuscript entitled “Méthode de maximis et minimis expliquée et envoyée 
par M. Fermat a M. Descartes.” 4 He considered a curve through the origin 


D 


like the one shown in the figure in which the origin is at C, DO is the x-axis, 
and DA is tangent to the curve at A. Fermat started [p. 154] by renaming 


23 Descartes, who gave no graph for this curve, expressed its equation as follows: “les deux 
cubes des deux lignes BC [our y] et CD [our x] soient ensemble égaux au parallélépipéede 
des deux lignes BC, CD et de la ligne donnée P,” Fermat, Euvres, II, 1894, p. 130. 

24 Fermat, Euvres, II, pp. 154-176. 
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some segments as follows (again reusing some letters as before): BA = B, 
BC = D, BD = A, and, for any point E on the tangent, BF = E. Then the 
coordinates of the point E on the tangent are [p. 155] CF = D—E, and, from 
the similarity of the triangles DFE and DBA, 
B(A—E) BA-—BE 
FE= = , 
A A 
Next Fermat considered FJ adequal to FE if E is close to A, and proceeded 
as if E were on the curve. Thus, replacing Descartes’ P with N, the property 
of la galande is CF? + FE? = N- CF - FE. With the expressions obtained 
above for C F and FE in terms of single letters, the sum of the cubes is [p. 156] 


B3 A? — BBE? —3B°A72E +3B°AEZ 


Pak =37 F+3pe 4 3 


while N - CF - FE is [p. 157] 


BA—BE NDBA— NDBE — NBAE + NBE* 


N(D — E)—— a 


Multiplying both expressions by A*, we must compare by adequality 


PALE a3P EA LID A LL PA RHR Hs AE LIOR AL 


with 

NDBA* — NDBEA? — NBEA? + NBE? A’. 
Since the equation for la galande is D* + B’ = NDB, the terms D? A> + B? A? 
and N DBA? can be dropped after adequation. Fermat completed this example 
as follows: 


Divide the rest by E and then drop all the terms that still contain £; it will 
remain [after division by —A?] 


3D°?A+3B°>=NDB+NBA 


and we will have 
NDB-3B _ 
3D2—-NB 
which is what we had to find 


’ 


That is, Fermat had found the subtangent. If we recall that the origin is at C 
and if we write D = CB = x and B = BA = y, then A = y/y’, and the 
reader should easily verify the correctness of Fermat’s result using today’s 
implicit differentiation. 
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5.3. FERMAT’S TREATISE ON QUADRATURES 


Fermat developed an interest in the problem of the quadrature of the higher 
parabolas and hyperbolas, and eventually found a solution, but he did not 
publish anything on this subject during his lifetime, which makes it difficul to 
date his work. From indirect evidence, it appears that he may have obtained 
some results by the 1640s and even by the 1630s.2° Much later, in 1658 or 
1659, he was motivated to write a treatise on quadratures by the appearance 
in 1655 of the Arithmetica infinitorum by John Wallis (1616-1703). Fermat 
claimed to have obtained the same results long ago and showed some criticisms 
of Wallis’ methods.”° 

In his treatise “On the transformation and amendment of local equations 
for the manifold comparisons of curvilinear figure among themselves or to 
rectilinear figures to which is annexed the use of geometric progressions for 
the quadrature of an infinit number of parabolas and hyperbolas,” 2” Fermat 
started with the following assertion [p. 44; 255]: 


Archimedes used geometric progressions only for the quadrature of the para- 
bola... I have recognized and proved this sort of progression to be very 
useful for quadratures, and I willingly communicate to modern geometers 
my invention, which performs the quadratures of parabolas and hyperbolas 
by an altogether similar method. 


He worked out the quadrature of the hyperbola x2y = 1 firs (he chose the 
right-hand side to be an arbitrary constant, but there is no loss of generality 
in taking it to be 1), then that of the parabola y* = x, and finall generalized 


25 For a discussion of these dates, see Boyer, The history of the calculus and its concep- 
tual development, 1959, p. 127; Bortolotti, “La scoperta e le successive generalizzazioni 
di un teorema fondamentale di calcolo integrale,” 1924, p. 215; Walker, A study of the 
Traité des Indivisibles of Gilles Persone de Roberval, 1932, pp. 142—164; or Mahoney, The 
mathematical career of Pierre de Fermat, 1601-1665, p. 244. 

26 Wallis found the area under an arc of y = x?/4, where p/q is any rational number 
other than —1, using amethod that he called investigatio per modum inductionis [Proposition 
XIX], which essentially means guessing what we would call some limits. His descriptions 
are purely verbal, without any variable or function notation [Propositions LV to LVI and 
CII]. The stated propositions can be seen in his Opera mathematica, 1, 1695, pp. 373, 
390-391 and 408. 

27 De cequationum localium transmutatione, & emendatione, ad multimodam curvilineo- 
rum inter se, vel cum rectilineis comparationem. Cvi annectitvr proportionis geometrice in 
quadrandis infinitis parabolis & hyperbolis usus. Page references are to Varia opera and, 
after a semicolon, to Guvres, I. See the bibliography for available English translations. 
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his method to all curves of the form x” y” = 1, m and n nonzero integers, 
with the exception of xy = 1. To give an idea of the method it will suffic to 
present some simple examples. Fermat considered firs the hyperbola DSEF 


C F 


A GH O M R 


shown in the figur (the dashed lines were not printed in Varia opera but added 
in Guvres to help visualize some rectangles), and since he did not have our 
equation for it, began the procedure as follows [pp. 44; 256—257]: 


Consider, if you please, the hyperbola define by the property 
AH? EG AO? HI 

= — an —iorepeesrers™ | 
AG? HI AH? NO 

[that is, x? = 1/y]. I say that the infinit space whose base is EG, and one of 


its sides is the curve E'S, and the other the infinit asymptote GOR, is equal 
to a given rectilinear area. 


etc. 


We shall show this and fin the “given rectilinear area” in two ways. First, 
using present-day notation, to give a quick idea of what is going on, and then in 
Fermat’s terminology, so that we know what he actually did. In the preceding 
figure let the origin be at A and defin a = AG. Now choose a real number 
r > | and then points G, H, O, M,... with abscissas 


a, ar, ar’, ar, 
Then the segments GH, HO, OM, MR,... have lengths 
a(r—1), ar(r—1), ar’(r — 1), are(r — We aay 
and the heights EG, HI, ON, PM,... have lengths 
1 1 1 1 
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so that the rectangles with diagonals EH, 10, NM, PR,... have areas 


(-), —4-), SGO-D, Se-D, 


These form a geometric progression with ratio 1/r, whose sum, according to 
Viete’s formula, is i 
Pies 
= 
aes a 

* 

It approaches 1/a as r — 1, which is, then, the area of the region DNEGR. 
Fermat, to whom all this notation was unavailable, chose the segments 

AG, AH, AO, AM, ... so that they formed a geometric progression, which 
he expressed as [p. 45; 257] 


AG AH AO 4g 


AH AO AM 


and this is equivalent to 


AG GH HO _ 


AH HO OM 


He did not give any reasons, but those who prefer an explanation should note 


that 
AG AH AG+GH 


AH AO AH+HO 
which, after dismissal of the middle fraction, cross multiplication of the re- 
maining fractions and simplification leads to 


AG GH 
AH HO 
Similarly for the other fractions. 
Then Fermat used this to examine the ratio of rectangular areas. For 
example [p. 45; 257], 
EGXxGH = GE GH = GE AG 
HIxHO HI HO HI AH 


28 Tn quoting or describing Fermat’s work I use modern fractions and dots as a convenience. 
Fermat did not use either, expressing himself in verbal form. For example, he gave this line 
as follows: “as AG, isto AH, sois AH, [to] AO, &sois AOtoAM ...” 
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On the other hand, by the definin property of the hyperbola (see the preceding 
quotation) and the fact that the proportionality of abscissas gives AH? = 
AG - AO, we have [p. 45; 258] 


GE _ HA* AO 4 
HI GA? GA 


Therefore, 
EGxGH _ AO AG _ AO _ AH 


HIxHO GA AH AH AG. 


Then he arrived at the following conclusion [pp. 45—46; 258]: 


Similarly we can prove that 


HIxHO _ AO 
ONxOM HA 


But the segments AO, HA, GA that make up the parallelogram ratios 
are proportional by construction [that is, GA/HA = HA/AO, so that 
AO/HA = HA/AG}; hence the [areas of the] parallelograms GExGH, 
HIxHO, ONxOM, etc., assumed to be infinitel many, will form a con- 
tinued geometric progression, the ratio of which will be HA/AG. 


Fermat had already remarked that the terms of the original progression (the 
abscissas) can be chosen close enough to each other that one can “‘adequate the 
rectilinear parallelogram GExGH and the mixed [curvilinear] quadrilateral 
GHE” [p. 45; 257]. Thus the sum of the areas of all the rectangles can be 
adequated to the area under the hyperbola to the right of EG. To sum the 
progression of rectangular areas Fermat could have used Viéte’s formula, with 
which he must have been familiar. However, he chose to state this main tool 
in the following manner, which is equivalent to Viete’s, at the start of his paper 
[pp. 44; 255-256]: 


Given any geometric progression, whose terms decrease indefinitel , the 
difference of two consecutive terms of this progression is to the smaller of 
the two as the largest of all the terms of the progression is to the sum of all 
the others to infinit . 


29 Fermat frequently changed the order of the letters in a segment as he wrote. 
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If S denotes the sum of the entire progression of rectangular areas, the appli- 
cation of this rule in this particular case gives [p. 46; 258] 


EGxGH —IHxHO = EGXxXGH 
IHxHO ~ §—EGxKGH 


If we recall that 
GA 
IHxHO = — EGxGH, 
HA 


then the left-hand side in the previous equation simplifie to and then equals 


HA a GH _ GExGH 
GA GA: GESGA 


On the other hand, if the progression ratio is close enough to 1, the denominator 
S — EGxXGH is adequal to the area of the figur. DN/JHR under the hyperbola, 
which is in turn adequal to the area DNEG R because the segment GH is very 
small. Thus, Fermat obtained the adequation 


GExGH ee EGxGH 
GEXGA ~ area DNEGR 


Since the numerators are equal, adequating the denominators, Fermat ex- 
pressed this result as follows [p. 46; 259]: 


... the parallelogram AE, in this kind of hyperbola, is equal to the area of 
the figur contained by the base E'G, the asymptote GR and the curve ED 
infinitel extended. 


Since the parallelogram AE has area 


AGXGE =a = =. 
a a 
this is the result already obtained above in current notation. 

Now we can understand Fermat’s emphasis on the geometric progression 
in the opening statements of this treatise on quadratures, for it is precisely the 
fact that the areas of his rectangles form such a progression that allowed him 
to fin the sum. He found a way to subdivide the axis so that this happens, but 
had he tried a subdivision by segments of equal length he would have been 
unable to succeed. 
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Then Fermat stated [p. 46; 259]: “It is not laborious to extend this discovery 
to all the hyperbolas of this sort, except the one that we mentioned [xy = 1].” 
He gave a brief proof that in the next case, x*y = 1 [pp. 46; 259-260], 


AGXGE =2DNEGR, 
or, if a = AG and GE = 1/a°, 


1 
DNEGR= —-: 
2a? 


After this, he claimed that the method would be the same in the remaining 
cases. 

However, Fermat may have faced a technical difficult while trying to 
use the same method to perform the quadrature of the firs parabola, which 
we denote by y? = x, shown below,°° over the finit interval CB. We shall 


C D 
H 
M 
N 
E 
B A 


perform this quadrature in current notation first and this will show the nature 
of the conjectured difficult. The points B, E, N, M, ... were originally 


chosen such that 
BC EC NC 


EC NC MC 
and then we can denote them by a, ar, ar*,ar*,... , where rr < 1 is very close 
to 1. Then the vertical sides of the circumscribed rectangles with diagonals 


AB,IN,OM,GH,... are 


wey 


a(l—r), ar(l—r), ar*(1 —r), arv(1 —r), 


ae) 


30 Tt was not uncommon at that time to draw the x-axis vertically with the positive direction 
downward. The letters Y and V were not printed in this figur in Varia opera. 
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and their horizontal sides BA, EI, NO, MG,... have lengths 
er, ar Tar GV. ahd 
so that the rectangles have areas 
er d=r), Gary la Gry a=9), Gry "d=; 


These form a geometric progression with ratio r°/*, whose sum, according to 
Viéte’s formula, is 

a/2(1 —r) 

aE 


It may not be immediately clear what results from adequating r to 1, other 
than 0/0, and this may have prompted Fermat to alter his method, for he did 
introduce a modification as we shall see below. However, a little algebra 
shows that 
l-—r 1 
372 — 
pl/2 4. 


l-—r 
14+r'/2 
Then, either letting r — 1 or adequating r to | in the denominator yields 3/2. 
Therefore, the area under the parabola y = ./x from 0 to a is 

Pee 

ARCB = —~. 

3/2 
Since the area of the rectangle ABCD is a./a = a*/”, putting b = a’/*, we 
also obtain the formula 


ARCD = a?” = = 


Fermat, ifhe had any difficult atall, was able to deal with it by an ingenious 
second subdivision of the horizontal axis [p. 47; 261]: 


. 1f we take the mean proportional CVbetween BC and CE, and between 
EC and NC the mean proportional YC, ... 


This means that the points V, Y,... are chosen so that 


BC CV CE YC 
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and, in particular, 
BC. By . BC 
EC BCxCE VC? 
Since the definin property of the parabola gives 


AB? _ BC 


IE2 CE 


it follows that 
AB BC CE 


TE VC YC 
This is one of the factors in the rectangle ratio 
AE AB BE 


IN IE EN’ 


and Fermat asserted that the remaining factor equals, “as has been shown 
above, BC/CE.” It had not been shown above, but he may have had in mind 


subtracting | from each side of the original identity 


BC EC 

EC NC 
and simplifying, which leads to 

BE EN 

EC NC 


Then 
BE EC BC 


EN NC CE 
and the rectangle ratio becomes 
AE CE BC _ BC 
IN YC CE YC 


Taking it as clear that this is also the ratio of the remaining rectangles, he 


proceeded to add up their areas [p. 47; 262]: 


and consequently from the theorem that substantiates our method, 


the parallelogram AE, is to the figur /RCHE, as the segment BY, is to the 


segment YC,3" therefore the same parallelogram AE, is to the total figur 
AIGRCEB, as the segment BY, is to the total diameter BC ... 


31 This segment appears as BC in Varia opera but was changed to YC in Euvres 
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But this is going a bit too fast, so we should add some explanation. The 
“substantiating” theorem, stated here on page 251, gives 
AE —IN AE 
IN arealRCHE 


and then using the rectangle ratio, the left-hand side simplifie to 


BC ,_BC-YC _ BY 
YC ~ YO YC 


When the rectangle ratio is very close to 1, then YC © BC and the areas of the 
figure IRCHE and AJGRCB ate approximately the same. Thus, the result of 
Fermat’s addition can be written as 


AE _ BY _ ABXBY _ ABxBY 
area AIGRCB BC ABxBC. BD 
and then 
BD _ ABXBY _ BY 4 
ARCB AE BE 


But because of the fin subdivision of BC, the segments BV, VE, EY are 
adequal, and then the right-hand side above is approximately equal to 3/2. 
Fermat concluded [p. 48; 263]: 


therefore the parallelogram BD, is to the figur as 3 is to 2. 


That is, 
BD 


ARCB = ; 
3/2 


and the quadrature has been completed. 

Fermat stated that his method applies to all the other parabolas without 
exception, and to leave no room for doubt [p. 48; 263], he showed in full 
the case of the parabola y? = x”. Now he needed two auxiliary subdivisions 
rather than one and was able to prove that [p. 49; 265] 


in this case the parallelogram BD, is to the figur as 5 is to 3. 


32 Tn Fermat’s own words [p. 47; 262]: “ut BY, ad BE, ita parallelogrammum BD, ad 
figura ARCB;” but an error in Varia opera has this figur as AROB. 
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That is, 


BD 
ARCB = —.- 
5/3 


This allowed him to deduce and state the following general rule: 
It is certainly clear that the parallelogram BD is always to the figur AJCB, 
as the sum of the exponents of the powers of the ordinate & the abscissa is 
to the exponent of the power of the ordinate .. . 

Thus, in the general case of y” = x’”, 


BD 
mtn 


n 


AICB = 


If we denote the abscissa of B by a and defin g = m/n, then the area of the 
rectangle BD is a-a"/" = a?*', and the preceding formula can be rewritten 
in current terminology as 


a qit! 
xl dx = , =], 
I qt+l a7 

a general formula that Fermat found before Newton. Particular cases for a 
few integral values of g were known to others before Fermat. 

Having stated a general rule for the quadrature of the parabolas must have 
inspired him to give one for hyperbolas, which he did next as follows [p. 49; 
266]: 


In fact in any hyperbola it is always [the case], if you go back to the firs 
figur [our page 249], that the parallelogram BG, is to the infinitel prolonged 
figure RG ED, as the difference of the exponents of the powers of the ordinate 
& the abscissa is to the exponent of the power of the ordinate. 


This means that for the general hyperbola x” y"” = 1 (reversing the order in 
Fermat’s “the ordinate & the abscissa’), 
BG 


m—n 


RGED= 


n 


Thus, if we defin a = AG and write g = m/n, then the area of BG is 
a-a“"/") — q'~4, Then, the preceding formula can be rewritten in current 


terminology as 
lee) qi-4 
/ x tax = > G#l. 
a q— 1 
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5.4 GREGORY’S CONTRIBUTIONS 


Gregory’s expertise in differentiation has already been shown in Chapter 4, 
where it was made clear that he was in possession of the chain rule and that 
he discovered Taylor’s theorem. A few years before, Gregory had visited 
Italy, and at the end of his stay there, in 1668, he published a book entitled 
Geometric pars vniversalis (The universal part of geometry),°° a work that 


GEOMETRIZ 
PARS VNIVERSALIS; 
Inferviens 
Quantitatum Curvarum tranfinurationi & menfurz. 
AVTHORE 
IACOBO GREGORIO 


ABREDONENS!I 
scoT 0. 


PATAVII, MDCLXVIII, 


Typis Heredum Pauli Frambotti, Saperiorum Perro 
CVM PRIVILEGIO. 


has been described as “the firs attempt to write a systematic text-book on 
what we should call the calculus.” * It is written in a geometrical style, as the 
title makes clear, and mostly in narrative form as was common at that time. 
It contains both original material and well-known results, and Gregory, in the 


33 Geometrie pars vniversalis, inserviens quantitatum curvarum transmutationi & men- 
sure, 1668. My discussion of this text here is based on and large portions taken from my 
paper “James Gregory’s calculus in the Geometriz pars universalis,” 2007. 

34 By Adolf Prag in “On James Gregory’s Geometrie pars universalis,” 1939. Prag also 
describes the contents of the book as follows: Propositions 1 to 11 are on arc length, area 
and tangent; Propositions 12 to 18 on involution and evolution; Propositions 19 to 45 on 
solids of revolution; and the rest applies to special figures 
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second page of the introduction, left it to the reader to judge which was which 
(que mea sunt & que aliena iudicet lector). 

There is very little on how to fin tangents in this book, just one example 
given as Proposition 7 [pp. 20—22], to fin the tangent to a curve that he firs 
described in words, with reference to the next figure as follows: 


A 


K Cc 


Let the curve BHC be a hyperbola, whose diameter is the straight line AK 
& the ordinates EH, KC, are of such nature that the solid from the square of 
BE times AE is to the solid from the square of BK times AK as the cube of 
EH is to the cube of KC. 


In other words, if A and B are fi ed, if B is the origin of coordinates, and if 
BE is an arbitrary abscissa with corresponding ordinate EH, then BE*x AE 
is a constant times E H? regardless of the position of the point H on the curve. 
Then Gregory set AB = a and BE = b, so that AE = a + b, and denoted the 
constant that we just mentioned by a?/c?, where c is another constant. This 
allowed him to give the equation of the curve as 


3 
Vab23 + b3c3 35 


a 


ro 3 ie 


The problem is to fin the subtangent FE, which Gregory denoted by z, for 
the tangent FH at H. To do this, he drew a new ordinate DG at a vanishingly 


35 Gregory, who omitted the equal sign, wrote /C for cube root. But notice that he 
wrote b* instead of bb. In this respect, he was way ahead of his time. 
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small distance o from EH (“DE nihil seu serum o’’), and then the product 


af ab?c3 + b3c3 


DFXEH = (z—0) 


a 
is approximately equal to the area of the rectangle with sides EF = z and 
DG. The length of DG is obtained from the equation of the curve after replac- 
ing b, which is the variable, with b — o, and then DFxEH ~ EFXDG, an 
approximation whose right-hand side is equal to 


3 
Vab22 — 23ab30 + Gao? + 32b323 — 303b2z30 + 3b 02 — 232303 


a 


Gregory then continued: 


& removing the denominators because they are equal, also & cubing both 
terms [sides] of the equation & removing from both the equal [terms] this 
equation results 
3b? azo’ — 3b’c'az’0 — b’ cao’ — 3b?c?z"0 + 3b°c*z0° — bc 0° 
= azo’ — 2c abz0 — 3c3b’z'0 + 3¢°bz0" — c*z30°, 
& dividing all by o 
3b°azo — 3B’ Caz — bead’? — 3b 2 + 3b°c8z0 — bce" 
= &azo— 2c ab? — 3b? + 30b20 — 8230", 
& rejecting any quantities [terms] in which we fin o or its powers [which is 
permissible as an approximation if 0 is very small], it remains 
—3bh’ az — 3b? = — 2 abe — 3c3b? 23, 
& whereby adding the defect [multiplying by —1] & dividing all by c*bz? 
the equation is 3ba + 3b? = 2az + 3bz, & therefore 
_ 3ba+ 3b° 
oe G2 + 3b. 


namely the straight line [segment] E F, which was to be found.°® 


36 Since he invited the reader to judge on his originality, we must say that this is strongly 
reminiscent of Fermat’s method. Jean de Beaugrand, a friend of Fermat and admirer of his 
work, made it his own business to disseminate Fermat’s method of tangents, so maligned 
by Descartes. But he introduced some variations, one of which was the replacement of 
Fermat’s E with o. This can be seen in his quadrature of the ellipse in the 1638 manuscript 
“De la maniére de trouver les tangentes des lignes courbes par l’algébre et des imperfections 
de celle du S. des C[artes].” In Fermat, Guvres, V, Supplément, 1922, pp. 102-113. The 
letter o is introduced in page 102. Surprisingly, for an admirer, Beaugrand did not say that 
the method of tangents he applied in this manuscript was due to Fermat. In any event, it 
was widely distributed, and, since Beaugrand traveled extensively in Italy, it is quite likely 
that Gregory saw a copy and adopted the symbol o, along with Fermat’s method. 


SECTION 5.4 GREGORY’S CONTRIBUTIONS 261 


It is easy to verify this result using current methods.°” 

There is no more on tangents per se in Gregory’s Geometric, but about 
two years later he gave the rules for findin subtangents of composite curves. 
They are in manuscript form on a piece of scrap paper. He gave six rules, of 
which we reproduce three.°° 


Let Ch, AK be two curves [see the next figure] and MB be a straight line, 
and let G/ be a curve with the property that GB is always equal to [the sum 
of] AB, CB; let the straight lines CD, AF touch the curves Ch, AK; put 
CB =a, DB =b, AB =c, FB = d; let 


adb+cdb 


and then join GH, which will touch the curve G/. 


FA D NB 


This find the subtangent BH at an arbitrary point G of a curve G/, which is 
the sum of Ch and AK, in terms of the subtangents b = DB andd = FB. In 
the seventeenth century mathematicians were keener on subtangents than on 
slopes, as we have already seen, but this result is not particularly appealing or 
easy to remember. 


37 Tt may be simpler to write x instead of b and y instead of EH, then fin y’, plug in 
x = b, and note that z = y/y’. 

38 The complete set in Latin, translated into English by Turnbull and with an explanatory 
note, can be seen in James Gregory tercentenary memorial volume, 1939, pp. 347-349. 
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Today we are interested in slopes—there is a very good reason for it to be 
discovered presently—and we can fin that of GH from those of CD and AF 
using Gregory’s formula. To do this denote GB, which Gregory did not name, 
by y, and then y = GB = CB+ AB =a-+c. The point B represents an 
arbitrary abscissa, and slopes and subtangents are related for our three curves 
by the equations (if none of the tangents are vertical) 

, GB y CB a AB c 


y =——=—, b=DB =—, and d=FB=—=-—. 
BH BH a’ a’ c! c! 
When the values of BH, b, and d obtained from these equations are substituted 
into Gregory’s formula (for which he did not provide a proof in this note to 


himself), it becomes 


ca ca a+c 
a Cc 
_ cla’ ca cla atc _ y 
f° Se, 1, o  ga@ gee 
c! / ay oh ome 
Cc a 


and then y’ = a’ +c’. This is the derivative of a sum. It is the sum of the 
derivatives, while, for subtangents, BH ~ b+d. This is why slopes are more 
convenient than subtangents in calculation. 

The next quotation will show that he dealt with products and quotients. 


But if GB/AB = CB/OB,°9 and if BH =a, BF =b, BD =c, itwill be 
acb 

BN = ————_- 
ab + ac —cb 
It may be annoying, but Gregory did change the notation, so that a, b, and c 
have new meanings now. If the four functions represented in the preceding 
figur are now denoted (from bottom to top) by y, wu, v, and w, the relationship 
between slopes and subtangents is as follows: 


y u v w 
y= > ioe eee and yw = —. 
BN c b a 
Then Gregory’s formula for the subtangent becomes 
Wu UD 
me wu 
y’ ue ve uv u’wv + v/'wu — w'uv 
wiv!’ ow 


39 Actually, Gregory wrote this as a proportion, that is, as GB: AB :: CB: OB. 
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and then, since GB - OB = CB. AB can be written as wy = uv, 


,  wu'v+ov'u)—uvw! — w(u'v + v'u) — uv’ 
= Ta = 2 : 
‘ai w 


y 


This is exactly the result that we would obtain today using the quotient and 
product rules to differentiate y = uv/w. But of course, this is a modern 
interpretation. 

After setting FB = d and HB = e, the last of Gregory’s statements, which 
needs interpretation, is this: 


Truly generally, if the ratio GB to CB is the ratio AB to CB multiplied [by 
itself] in the ratio m to n, in every case it will be 


mde — nde 
DB = ————_. 
me—dn 


Here m and n are just positive integers, and the stated hypothesis means that 


GB AB m/n 
om ie 


which, if we use u, v, and w as in the preceding case, becomes w/u = (wiuyr, 
Since 


Gregory’s formula for the subtangent DB becomes 


Uv Ww Vv WwW 
m—— —n—— 
ui yw! vw! (m —n)vw 
ro Ww v _ a ! 
Uu m———n MWU vwin 
! yp! 


It is easy to verify that this is the result obtained using today’s rules to dif- 
ferentiate w/u = (v/u)'"/" (or, rather, w"u’"—" = v'), so that Gregory could 
have differentiated a power with an arbitrary rational exponent if he had not 
been thinking of subtangents.*° 


40 At the end of the explanatory note to this manuscript, Turnbull, who had dated it to 
November 1670 at the earliest, states that this date “suggests that it was inspired by reading 
Barrows’s book on Geometry [see Section 5.5], which reached Gregory during August 
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Returning to the Geometric, we fin at the outset some work on arc length. 
The firs proposition is just preparatory, but in Proposition 2 [pp. 3—8] Gregory 
arrived at the result that we know today. Given an arbitrary curve 79CD, 
which he assumed to be simplex, seu non sinuosa (monotonic, and in this case 
increasing) or else that it must be split into pieces, Gregory started by selecting 
an arbitrary constant X, which he viewed as the length of a segment. Now 
let the segment 96 in the next figur “1 be normal to the curve 79CD at an 
arbitrary abscissa 3, and construct another curve PNLH such that its ordinate 
at 3 is the product of X and the quotient of the length 96 over the length 93. 
Then he stated a theorem, proved it, and concluded as follows [p. 8]: 


It is also clear that the [area of the] mixtilinear figur PNLH62 is to the 
[area of the] rectangle X times 26, as the [length of the] curve 79CD is to 
the straight line [segment] 26. 


Gregory’s X is there for reasons of geometric homogeneity, an important 
concept in his time, but if we take X = 1, his statement reduces to 


length of 7D = area PH82. 


We do not present the proof, which is long, involved, and heavily multi- 
notational, but just interpret it in more familiar terms as follows. Let the 
arbitrary abscissa at 3 be denoted by x, let the arc 79CD be the graph of a 
differentiable function f, and let the segment 9B be tangent to the curve at 9. 
Then, keeping in mind that the numbers here are just labels for points and 


1670, and which contained a systematic account of such processes of differentiation in a 
geometrical form.” He did not give any supporting reasons. But Gregory himself shed 
some light on this issue when, in a letter of 5 September 1670, he wrote: “I have read over 
both Mr Barrows bookes of Lectures, with much pleasure and attention, wherein I fin 
him to have infinitel transcended all that ever writt before him, I have discovered from 
his method of drawing Tangents, togeather with some of my owne, a generall Geometrical] 
method without Calculation of drawing tangents to all Curves, comprehending not only 
Mr Barrows particular methods but also his generall Analyticall method in the end of the 
10th Lecture, my method contains not above 12 Propositions.” So Gregory learned from 
Barrow but had additional methods of his own. The differentiation of rational powers in 
Gregory precedes the Barrow book, a book that does not contain the differentiation of a sum 
or difference, in spite of some exaggerated twentieth-century claims on this issue. More on 
this in the next section. 

41 This is just an extract from Gregory’s own figure The same is true of all but the next of 
the figure from the Geometria reproduced here. Only those features are reproduced that 
are necessary to illustrate the statements made in this presentation. Additional lines, points, 
and letters used in omitted proofs have not been included. 
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using the similarity of triangles 936 and 9OB, we see that with X = | the 
ordinate of PNLH at x is 


96 _ JOR +36 ft ee) 
93 93 YY * \93} — 90 


The last quotient on the right is what we now call f’(x), and then, using our 
definit integration for area, Gregory’s discovery can be expressed in today’s 


familiar form as 
5 
length of 7D = i Jl+(f". 
2 


The most notable result involving area and arc length in the Geometrie is 
Proposition 6 [pp. 17-19]. It is in the form of a problem to be solved, and 
it can be interpreted, rather than purely translated, with reference to the next 
figure in which angles that appear right are indeed right angles. The problem 
is this: for any given curve BNS, fin acurve AKQ such that, for any point J 
on its “axis” AO, the length of the arc AK is to that of the segment A/ as the 
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O POR s 


area ABNI bound by BNS and the axis is to the inscribed rectangle A BMI.*2 
To solve the problem, Gregory assumed that the curve BNS is simplex seu 
non sinuosa, and then constructed two curves in the following manner: 


Henceforth let the curve AFLP be of such nature, that, if any straight line 
[segment] /N is drawn perpendicular to the straight line A O, the curve A FL P 
cutting it at L, the square of the straight line [segment] JN is equal to those of 
IL, IM both; then draw the curve AEK Q of such nature, that, if any straight 
line [segment] /M is drawn perpendicular to the straight line AO, & cutting 
the curve AEK Q at K & AFLP at L, the rectangle MIK 43 [that is, the area 
MIx1K_} is equal to that of the mixtilinear [figure JA FL. 


It would be convenient to interpret this in present-day notation before going 
ahead. Choose A as the origin of coordinates; let the x-axis contain the segment 
AO with its positive direction downward; let the y-axis contain the segment 
AB with the positive direction to the right; let the curves BNS, AFLP, and 
AEKQ be the graphs of functions that we denote by f, g, and h; and denote 
the length of AB by a. We assume, as in Gregory’s figure that f is a positive 
increasing function with a negative second derivative. Then Gregory’s curves 


42 The Latin original is this: “Invenire curvam [A K Q], que ad suam axem [A/O] eandem 
habeat rationem, quam figur queelibet exhibita [A BNSO] ad rectangulum sibi inscriptum 
[ABRO].” 


43 The original says MIL, but this is just a typo that is not repeated in the rest of the proof. 
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are constructed so that 
IN? =I1L? + 1M’, or f?=g*+4+a’, 
and, if the abscissa of J is x, 


x 
MIxIK =arealAFL, or ah(x) =} g. 
0 


Once these curves were constructed, Gregory stated the solution to the 
problem in the following terms: 


I say that [the area of] the figur ABSO is to [that of] the rectangle ABRO 
as [the length of] the curve AEK Q is to [that of] the straight line [segment] 
AO. 


With today’s terminology this can be proved rather quickly, and not just from 
A to O but also from A to an arbitrary point / in AO. Indeed, the area of the 


figur ABNI is 
: 
fs 
0 


while the length of the arc AEK is 


i i 2 % 2 2 id 

i VIF ary = f ji+5= | jit — --f f 
0 0 a 0 a a Jo 
Dividing both sides by AJ and putting 7 = O proves Gregory’s assertion. But 
this proof is short and simple only because we have used the full machinery of 
the calculus as it is known today, including the formula for arc length and the 
fundamental theorem of the integral calculus to differentiate h. However, this 
machinery was not available to Gregory, who had to create his own results as 
he went. As we have seen, he had already obtained the formula for arc length 
in Proposition 2, and the fundamental theorem of calculus is precisely what 
the proof of Proposition 6 contains in disguise. 

By the use of today’s fractions and mathematical symbols, Gregory’s proof 
[pp. 17-19] can be both transcribed and interpreted as follows. Assuming the 
curve AEK@Q to have been constructed, choose a point C in the segment AO, 
whose abscissa we denote by xo, such that 


IL IK g(x) h(x) 
=> ’ or a — i | 


IM IC a x — x0 
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and draw the segment KC. The heart of the proof lies in showing that this 
segment is tangent to the curve AEK@Q at K, or, since this curve is the graph 
of the function h, that 

h(x) 


x — X09 


h' (x) = 


In view of the previous equation, this is equivalent to h’(x) = g(x)/a. Re- 
calling the definitio of h, we observe that this is the fundamental theorem of 
calculus. 

To prove it, Gregory showed firs that KC cannot intersect DH at a point a 
to the right of EZ, as shown in the figure To see this, we start by drawing the 
rectangle JLZC, which intersects DH at X. The equation definin C above 
can be rewritten as 


MIxIK =ILxIC, or ah(x) = g(x)(x — x0). 


If we denote the abscissa of D by x; and note that the construction of the curve 
AEK@Q also gives 


xX] 
GDxDE = area DAF, or ah(x}) = g, 
0 


then we have 


IK GDxIK MIxIK arealAFL 


DE  GDxDE areaDAF | area DAF 


Since DE < Da, using similarity of triangles and the integral representations 
of ah(x) and ah(x)) already stated, we can write 


x 
arealAFL IK I¢ area [Z [ g (x — x0) g(x) 
> => => ? or > ’ 
areaDAF Da DC areaDZ is , (x1 — xo) g(x) 
0 
which after inverting the fractions gives 
X] 
areaDAF  areaDZ i 5 (1 — x0) g(x) 


< ’ x < 
arealAFL  arealZ i ‘ (x — x0) g(x) 
0 
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Next, multiplying this inequality by —1, adding | to both sides, and simplifying 
yields 


Xx 
arealTDFL ~~ arealX [ _ (x — x1) g(x) 
> ’ 7 = = 
arealIAFL arealZ [ : (x — xo) g(x) 
0 


The denominators of the two fractions in each of these inequalities are the 
same (once more, by the construction of the curve AEKQ and the definitio 
of C, or by the definitio of / and the stated equation ah(x) = g(x)(x — x0)), 
so we conclude that 


x 
area! DFL > arealXx, or / g>(x—x1)g(%), 
x] 
which is absurd.“4 
Similarly, Gregory showed that KC cannot intersect the curve AEKQ on 
the arc K Q. Therefore, KC is tangent to AEK Q. As we have remarked above, 
this is a proof of the fundamental theorem of calculus, although it is more than 


likely that Gregory was not aware of this general relationship between what 
we now call differentiation and integration. 


44 To the best of my knowledge, the Geometric has not been either reprinted or translated 
since 1668, so here isa direct translation of the part of the proof transcribed above: “Let K be 
any point whatever on the curve A FL P, through which draw a straight line JN perpendicular 
to the straight line AO & cutting the curves AFLP, BR, BHNS, at points L, M, N; so that 
as IL is to IM so is IK to IC & draw KC: the straight line KC cuts or is tangent to the 
curve AQ at the point K; if it happens that it cuts in K & therefore falls inside the curve 
and actually inside the point E towards the vertex A: draw through the point E the straight 
line DH parallel to the straight line /N, cutting the curves AQ, AP, BR, BS at the points 
E, F, G, H, & the straight line LC [this is a typo; he means KC] in a, also complete the 
rectangle JL ZC, whose side LZ cuts the straight line DH at X. Seeing that /L is to 1M 
as IK is to IC, it will be that the rectangle M/K or the mixtilinear figur [just mixtilineum 
in Gregory, and henceforth just ‘figure here] JA FL is equal to the rectangle JZ; & because 
the rectangle GDE is equal to the figur DAF, it will be that as 1K is to DE so is the 
figur JAFL to the figur DAF, but /K has a larger ratio to DE than to Da; & therefore 
the figur JAFL has a larger ratio to the figur DAF than /K has to Da or IC to DC; 
& consequently the figur AFL has a larger ratio to the figur DAF than the rectangle 
IZ to the rectangle DZ, & by conversion the ratio of the figur 7AFL to the figur /DFL 
is smaller than the ratio of the rectangle /Z to the rectangle [X, & exchanging, the figur 
IAFL has a smaller ratio to the rectangle 7Z than the figur /DFL to the rectangle /X, 
whereas the rectangle /Z is equal to the figur JA FL, it will follow that the rectangle X is 
smaller than the figur /DFL, but it is larger, which is absurd.” 
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Once this was established, he drew KT perpendicular to KC, and, by 
similarity of triangles, 
CK KT 
Cr IK 
The segments IN, IM, and IL also make up a right triangle (since JN? = 
IL* + IM’), which is similar to the triangle C/K because of the equation 


IL/IM =1K/IC used to defin C. It follows that 
IN CK OKT xe 


IM cI IK 
Without further explanations, Gregory concluded by saying that 


since this can be done in the same manner at any point of the curve AQ, it is 
clear from the above [Proposition] 2 that the straight line AO is to the curve 
AQ as the rectangle OB is to the figur ABSO, which was to be proved. 


But a word of explanation is preferable. The conclusion of Proposition 2 on 
arc length, after replacing PNLH62 with BNIA, X with 1M, 26 with AJ, and 
the curve 79CD with the curve AEK, shows that 
area BNIA length of AEK 
IMxAI Al 
Al IMxAI area 1B 
length of AEK area ABNI area ABNI 

Gregory’s conclusion follows by taking J = O, K = Q,andN = S. 

Gregory did not have what we now call the fundamental theorem of calculus 
when he started his proof of Proposition 6, but he got it and he used it when 
he needed it, and he was the firs to do so. 

Proposition 11 may be another first because, when generously interpreted 
in today’s terms, it happens to be the firs use of the method of substitution to 
evaluate an integral. Gregory’s statement is as follows [p. 27]: 


whence 


PROP. 11. THEOREM. 


Consider any mixtilinear space ABKI comprised between a curve BK, a 
straight line AI & two parallel straight lines BA, KI; and let the curve MY 


45 For the reason stated before, here is a direct translation of what Gregory actually wrote: 
“Let the perpendicular KT to the straight line CK meet the straight line AO at T; it is clear 
that CJ isto CK as IK isto KT; but on the other hand C/ isto CK as M1 is to NIJ, because 
the straight lines /N, 1M, IL, make a right triangle similar to the triangle C/K, whose sides 
IM, IN, are homologous to the sides CJ, CK; & hence as IK is to KT so is IM to IN;” 
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be of such nature that (having taken any point C on the curve BK & from 


A 


Y 


this drawn a straight line CE parallel to the straight line AB, and the straight 
line CZ tangent to the curve BK and ending on the straight line AI, if ex- 
tended to Z) the segment EZ is always equal to the segment CS parallel to 
the straight line AZ & ending on the curve YM. I say that the figure BK MY, 
comprised between the curves BK, MY, & the straight lines BY, K M, parallel 
to the straight line AZ, is equal to the mixtilinear figure BAIK. 


Gregory gave a geometric proof, based on the use of Proposition 10, that 
we do not reproduce. What Proposition 11 does is to reduce the computation 
of the area between the curve BCK and the segment AE/ to the computation 
of the area between the curves BCK and MSY. To express this in present-day 
notation, consider the particular case in which the segments BA, CE, and KI 
are perpendicular to AZ. Then choose coordinate axes with origin at J, the 
positive x-axis along JA and the positive y-axis along /K. Let the curve BCK 
be the graph of a differentiable and increasing function f whose inverse is 
denoted by g, denote the abscissa of A by a, and let the coordinates of an 
arbitrary point C on the curve be (x, y). Then 


area ABKI = [ f(x) dx. 
0 
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Now, the area between the curves BCK and MSY is the same as the area under 
a curve whose ordinate at each arbitrary point y from f (0) to f(a) is CS. Since 


oy = £5. 5 
a Fa) FeO). 


it follows that, under appropriate hypotheses, 


f(a) 
area BK MY = / 


f(a) 
CSdy = i Flee") dy. 
f(0) f(O) 


Thus Gregory’s proposition, that area ABKI = area BK MY, can be rewritten 
as 


f 
f(O) 
This is integration by a change of variable. 
Propositions 7 and 11 were not to remain dead ends. Gregory used both in 
proving the following important result [pp. 102—103], which we shall interpret 
after the proof: 


a f(a) 
[ Aan : EOE Ody. 


PROP. 54. THEOREM. 


Consider the parallelogram ABIK, and let the curve ACI be of such nature, 
that (having drawn any straight line [segment] DE parallel & equal to the 
straight line [segment] AB cutting the curve at C) the ratio AB to EC is the 
product of the ratio AK to AE [by itself] in the ratio of P to Q. I say that [the 
area of | the parallelogram ABIK is to [that of | the figure ACIK as P + Qis 


to Q. 


The statement about the ratio AB to EC means that 


AB i 
EC \AE 


where P and @Q are just positive integers. 

To prove the proposition, Gregory drew the tangent to ACJ at C until it 
intersected the extension of AE at F. Then he drew the segment CH parallel 
and equal in length to EF. If this is done for every arbitrary point C on ACI, a 
new curve AHG results. Then Gregory wrote: “‘it is clear from [Proposition] 7 
above that the straight line [segment] AE is to FE, or LC is to HC as P is 
to Q,” which we shall accept with the meaning that he had figure this out, 
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in one of his famous pieces of scrap paper, by the same method used in the 
example of Proposition 7. But if this ratio is valid for an arbitrary C on ACI, 
then 


it is evident that the figur AC/JB is to the figur ACIGH as P is to Q; 
but from [Proposition] 11 above the figur ACIGH is equal to the figur 
ACIK & therefore the figur AC/JB is to the figur ACIK as P is to Q, & 
componendo the parallelogram ABIK is to the figur ACIK as P + Q is 
to Q quod demonstrandum erat. 


The following is a transcription of this statement into more modern language: 


area ACI B P 
areaACIGH | OQ 


from which it follows by Proposition 11 (with ACJ in the role of BCK and the 
points F and H in place of Z and S) that area ACIGH = area ACI K, with 


the result that 
area ACI B P. 


area ACIK Q , 
Adding | to both sides and simplifying, we arrive at 


areaABIK  P+Q 
area ACIK Q 


which was to be proved. 
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To interpret this proposition in current terminology, assume that the par- 
allelogram ABIK is a rectangle (it is shown as such in Gregory’s own figure) 
and choose coordinate axes with origin at A, the positive x-axis along AK and 
the positive y-axis along AB. Let ACI be the graph of a differentiable func- 
tion f,andlet AE = x, EC = y= f(x), AK =a,and KI = AB = b. Then, 
if we use lowercase letters for the integers P and Q, Gregory’s hypothesis on 
the ratio AB/EC becomes 

b ( a ) P/q 
fix) \x 


so the equation of the curve ACI is 


P/4 
f(x) = »(*) F 
a 


a b a 
area ACI K =i f= ij xP/4 dx, 
0 aPld Jo 


while area ABIK = ab. Gregory’s proposition, rewritten as 


Then 


area ACI K = a area ABI K, 
Pp+q 


then translates to 
P+ 


[stax= ll 
0 Puy 


q 
after some obvious simplification Thus, what Gregory effectively did was to 
integrate a rational power of x. 
However, Fermat had already shown this. What was new with Gregory is 
the statement he made in his proof about the use of Proposition 7, enclosed in 
quotation marks above. It can be restated as 


x LC D 


FE HC q 


so that FE = qx/p. But f’(x) = CE/FE = f (x)/FE, and then 


P/q | 
fw=tfes= ?»(=) = 24(=) 1 
qx qx \a q \a a 
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which amounts to differentiating a rational power of x. 

It may be a matter of wonder to us, a painful experience too, that these 
familiar results were obtained by the painstaking application of classical geo- 
metry. But in the absence of the now familiar concept of limit, at a time when 
what we know now had yet to be discovered, how else could anyone deal with 
problems about tangents and areas but by resorting to the well-founded Eu- 
clidean geometry? There will be an answer to this question, but not quite yet. 

Without some necessary algebrization, Gregory’s published results lack 
the agility and the power to be called a calculus. But we should remember 
those higher-order “derivatives” that he computed to obtain the infinit series 
expansions described in Chapter 4. They suggest that he may have created 
such a calculus, but, if so, it never saw the light through timely publication. 


5.5 BARROW’S GEOMETRIC CALCULUS 


In 1669 Isaac Barrow (1630-1677), the firs Lucasian Professor of Mathema- 
tics at Trinity College, Cambridge, resigned his position and recommended one 
of his former students, the 27-year-old Isaac Newton, as his successor. With 
Newton’s assistance, Barrow had prepared the publication of his mathematical 
swan song, the Lectiones geometrice, which appeared the following year.*® 

To anyone perusing this volume for the firs time it might appear as a 
collection of theorems on geometry, perhaps not a particularly interesting 
one at that.4” But a closer examination reveals that quite a number of the 
propositions are about tangents, quadratures, and related topics. Cloaked in 
very thick geometric garb, practically unrecognizable, these are well-known 
results of the calculus. The firs of these in the Lectiones that is significan is 
the following proposition from Lecture VIII [p. 64; 91]: 


V. Let VET be a straight line,48 and let two curves YFN, ZG O be related 
to one another, so that if any straight line E FG is parallel to a straight line 


46 Lectiones geometrice; in quibus (preesertim) generalia curvarum linearum symptomata 
declarantur, 1670; 2nd. ed. 1674. Abridged translation by Child as The geometrical lectures 
of Isaac Barrow, 1916. Page references are to the 1674 edition and to Child’s translation, in 
this order and separated by a semicolon. The translations included here are modification 
of those by Child, except for the one of Examp. I, which is my own. 

47 Child, the translator, in conveying his own firs impression on page 28 of the Introduc- 
tion, stated that the results presented by Barrow “seemed to be more or less a haphazard 
grouping, in which one proposition did occasionally lead to another; but certain of the more 
difficul constructions were apparently without any hint from the preceding propositions.” 

48 Child’s translation has TE/ and V is not shown in his figure 
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IsAAC BARROW 


Portrait by Isaac Whood 
Engraving by Benjamin Holl 


AB [not shown] given in position, the intercepts EG, EF are always in the 
same ratio to one another; while the straight line TG touches one curve ZGO 


L G T 


in G (and comes together with the straight line VE at 7) then TF, extended, 
will also touch the other curve YFN. 


In short, if the quotient EG/EF is constant regardless of the position of the 
line EFG, as long as it is parallel to AB, and if TG is tangent to ZGO at G, 
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LECTION ES 
Geometrice ; 


In quibus (prxfertim) 
Genenatia Curvarum Lincarum Syme touarta 
DECLARANTHUR. 


Au@ore Isaaco Barrow Collegii 
SS. Trinilatis id Acad. Cantab. Socio, & Pease Kee! 
sia Sodale. 


Oj quem Aeyrmar vie walla rd padipala, ot TxD vinsin, tw gai- 
vera’ in Begdiig » af tv rity mutdscn % yoprdowrra, nds} 
pais dre seduddew, tos dey 13 iEvrigg merei a'vrar ylydaa 
mins FrDM aay, Plato de Repub. VII. 


LONDINI, 
Typis Gulelmi Godbid , 8 proftant venales apud 
Johauacm Dunmore, & Oftavianens Pulleyn Juniorem, 
M.D (LX X, 


then 7F is tangent to YFN at F. Barrow gave a simple geometric proof of this 
fact, but it is not the proof that is of interest to us.*° 

What matters is what this means, so rotate the figur so that the line VE/ 
is horizontal and take it to be the x-axis with the positive direction to the left. 
Now consider the particular case in which the line AB is vertical and take it 
to be the y-axis with the positive direction upward. If the curves ZOG and 
YFN are denoted, in function notation, by y = f(x) and y = g(x), then the 
hypothesis that EG/E F equals some constant k means that f = kg, and then 


ya ag 
Ie TE TE aN 


for any x. Thus, in our language, Barrow has shown that if f = kg then 
f’ = kg’, which is one of the theorems of the calculus.°° 


48 Since it depends on a previous proposition it would take a little too much space to 
reproduce here. But those readers who would check with the original sources should be 
aware of the fact that both Barrow’s original and Child’s translation contain typos (not the 
same), but they are easy to identify and correct. 

50 This is one of the “processes of differentiation in a geometrical form” that Turnbull 
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Here is another theorem from Lecture VIII that has the distinction of being 
both nontrivial and easy to explain [pp. 65; 93-94]: 


IX. Let VD be a straight line, and X EM, YFN two curves so related that, 
if any straight line EDF is freely drawn parallel to a line given in position, 
the rectangle from DE, DF is always equal to any the same space [area]; 


also the straight line ET touches the curve X EM at E, and concurs with VD 
in T; then, if we take DS = DT; & FS is joined; [FS] will touch the curve 
YFN at F. 


The hypothesis is that the curves are related in such a manner that the product 
DExDF is constant regardless of the location of the line E DF. In other words, 
if JN is any line drawn parallel to EF and if it intersects the curves at M and N, 
then DEx DF = PMx PN. The complete proof is as follows [pp. 65-66; 94]: 


thought might have inspired Gregory. This is quite possible, but Child’s assertion on 
page 31 of the Introduction to his translation of Barrow’s Lectiones, to the effect that 
this proposition gives the derivative of a sum, seems to be an exaggeration. At the end of 
the translation of Lecture VII there is a note by Child in which he admitted that Barrow 
omitted a theorem, then Child himself stated and proved that theorem in the style of Barrow 
[p. 99], and this is the equivalent of the differentiation of a sum. Then he expressed his 
belief that [p. 100] “the omission of the theorem was intentional;” stating that “Barrow may 
have thought it evident, ... but I prefer to think that he considered it as a corollary of the 
theorem of §V;” that is, the one just presented. Then Child showed how such a corollary can 
be obtained, but since there is no figur and it is not clear what points are represented by his 
letters a and b, I have been unable to understand it. In any event, it is clear that Child was 
passionate about Barrow’s work, and eager to justify the opening assertion of his preface, 
that [p. xiii] “Isaac Barrow was the firs inventor of the Infinitesima Calculus.” 

However, after the work of Viéte, Descartes, and others, mathematics had already de- 
veloped as a fertile soil for the ideas of the calculus. It is not surprising that they were 
developed by several men of genius almost simultaneously. It is neither fair nor necessary 
to attempt to give all the credit to a single individual. 
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For [let] any [straight line] JN be drawn parallel to EF; cutting the given 
lines as shown. Then 91 
TP (= ) TD SP SD 


> — and also —_— = — 
PM PI DE PK DF 


Therefore 
TPxSP TDxSD TDxSD 


PMxPK  DEXDF  PMxPN_ 


But TDxSD > TPxSP [by the Euclidean proposition quoted here on 
page 241, since D is the midpoint of TS]; hence all the more, 


TDxSD  TDxSD 
PMxPK  PMxPN 


Therefore, 
PMxPK < PMxPN; or PK < PN. 


Thus the whole line F'S lies outside the curve YF N. 


In other words, what Barrow is trying to say is that if the curves are related as 
stated before the proof, and if ET is tangent to X EM at E, then F'S is tangent 
to YFN at F. 

What does this really mean? Is it a result that we should be excited about? 
To better explain it, we start by reconstructing the figur with some rectangular 
axes, as is the custom now, and then we shall use coordinate and function 
notation again. Take the line V D to be the x-axis with the positive direction to 
the right and let the line given in position be a vertical y-axis with the positive 
direction upward. Let the negative part of the y-axis represent a positive 
z-axis. If the curves YFN and XEM are then denoted by y = f(x) and 
Z = g(x), respectively, then the fact that they are related by DEXDF = k, 
where k is a positive constant, means that (referring to the figure y = k/z. 
The slopes of the tangents to the graphs of g and f at E and at F are, in the 
arrangement shown in the figur and using the hypothesis that TD = DS and 
current notation, 


, DE DF DF 
Z=— and y =-— =-—- 
TD DS TD 


51T am using modern fractions and symbols. Barrow wrote TP.PM C~ (TP.PI::) 
T D.DE and so on. 
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Then 
y! DF DEX DF k 
‘DE DE2 2 
so that 
/ k / 
a 
72 


Thus, what Barrow essentially did is to fin the derivative of y = k/z geo- 
metrically, a task for which we use the chain rule today. 

Immediately after the proof he considered the particular case in which the 
curve X EM isa straight line [p. 66; 94]: 


Note. If the line X EM were a straight line (and so coincident with TE /) 
Y FN would be the ordinary hyperbola, whose centre is T, one asymptote 
T S, and the other TZ parallel to EF. 


This statement may not be a model of clarity (in particular, T Z is not parallel to 
EF in Barrow’s own figure) but here is what it means in today’s terminology. 
Since no mention has been made about the location of the vertical axis, we 
can place it at will, as we did in the last figure and assume that the equation 
of the line X EM is z = mx, and then y = k/(mx), the equation of an ordinary 
hyperbola. In this case, z’ = m and 


yr a m= : 
m2 x2 mx2 


Using similar techniques, Barrow established many results in what we now 
call differentiation in Lectures VIII and IX.5* However, his heavy geometric 


52 For the complete list see Child, The geometrical lectures of Isaac Barrow, pp. 30-31, 
but take it with a grain of salt. 
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approach almost hides this fact and has left many people wondering whether 
he really knew what he was doing. Barrow himself stated at the start of Lecture 
VI that the last six lectures would be partly [p. 45; 66] 


about an investigation of tangents, without the trouble or annoyance of cal- 
culation ... partly about the ready determination of the dimensions of many 
magnitudes, with the help of designated tangents... 


Since that is what the last six lectures actually delivered, Barrow must have 
known what he was doing. 

But then Barrow diverged from his self-imposed method of strict geomet- 
ric constructions when, at the end of Lecture X [pp. 80—81; 119-120], he 
presented an alternative, and more algebraic 


method for findin tangents by Calculation commonly used by us. Although 
I hardly know, after so many well-known and well-worn methods of the kind 
above, whether there is any advantage in doing so. Yet I do so on the advice of 
a Friend;°° and all the more willingly, because it seems to me more profitabl 
and general than those which I have discussed. Accordingly, I proceed as 
follows. 


Let AP, PM be two straight lines given in position (of which PM cuts a 
given curve in M) & let MT be supposed to touch the curve at M, and to cut 


M 


A T OQ P 


the straight line at T; in order to fin the quantity of the straight line PT; I set 
off an indefinitel small arc, MN, of the curve; then I draw straight lines NQ 
parallel to MP & NR to AP;I call MP =m; PT =t; MR=a; NR =e; 


53 Tn the draft of a letter found by Louis More in the Portsmouth collection, Newton stated: 
“A paper of mine gave occasion to Dr. Barrow [who] showed me his method of tangents 
before he inserted it into his 10th Geometrical Lecture. For I am that friend which he then 
mentioned.” More, Isaac Newton. A biography, 1934; reprinted by Dover Publications, 
1962, p. 185, n. 35. 
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and the other straight lines, determined by the special nature of the curve, 
useful for the matter in hand, I also designate by name; also I compare MR, 
NR (& through them MP, PT) with one another by means of an equation 
obtained by Calculation; meantime observing the following rules. 


1. In the calculation, I omit all terms containing a power of a or e, or 
products of these (for these terms have no value). 


2. After the equation has been formed, | reject all terms consisting of let- 
ters denoting known or determined quantities; or terms which do not contain 
a or e (for these terms brought over to one side of the equation, will always 
be equal to zero). 


3. I substitute m (or MP) for a, and t (or PT) for e. Hence at length the 
quantity of PT is found. 


Rule 3 may profi from a bit of explanation. Since the “triangles” NMR and 
TMP are (approximately) similar, we can write 

m MP EP t 

a MR’ NR e 


and then whatever equation there is at the start of Rule 3 can be multiplied by 
m/a=t/e. 

There is no geometric proof of this alternative method. This is an algorithm, 
much in the manner of Fermat’s Methodus and not too dissimilar. Barrow 
was, in all likelihood, not directly acquainted with Fermat’s work because his 
name is not mentioned in the Lectiones. But he mentioned those of Gregory 
[p. 89; 131] and Huygens [p. 94; 141], both of whom had used Fermat’s 
method. So there is a distinct possibility of an indirect influenc of Fermat 
on Barrow. But Barrow’s method is an improvement over Fermat’s Methodus 
because the use of two small quantities, a and e, instead of Fermat’s E, makes 
Barrow’s procedure better suited for applications to implicit functions. Of this 
alternative method Barrow gave fi e examples, of which we present the third, 
which he labeled in the margin La Galande, that is, the folium of Descartes. It 
is possible to quote it in its entirety, with some explanatory notes in brackets 
[pp. 82—83; not in Child]: 


Examp. Il. 


Let AZ be a straight line given in position, & AX a magnitude [a given 
constant]; and also let AMO be a curve such that, when any straight line 
M P whatever is drawn normal to AZ, it will be AP cub. + PM cub. = 
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AXxAPxPM [that is, x? + y? = bxy, where b is a constant]. 

Set AX = b; & AP = f; then AO = f —e; & AO cub. = f> —3ffe 
[omitting terms that contain powers of e, according to Rule 1]; & QN cub. 
[= PR? = (m —a)*] = m> — 3mma [Rule 1]. & AOx ON = fm— fa— 
me+ae = fm— fa—me [Rule 1]; from which AXx AQxQN = bfm — 
bfa—bme; hence the equation f*—3 f fe+m>?—3mma = bf m—bfa—bme; 
or removing terms to be rejected [by Rule 2, since f? + m>? = bfm], bfa — 
3mma = 3 f fe — bme; substituting [Rule 3] bfm — 3m? = 3 f ft — bmt; or, 

bfm — 3m 


3ff—bm 


This solution can now be compared with the one given by Fermat. 


5.6 FROM TANGENTS TO QUADRATURES 


Barrow gave the following important result in Lecture X [pp. 78; 116-117], 
which, with reference to the next figure 4 can be translated as follows: 


XI. Let ZGE be any line [curve], whose axis is VD;°° to which ap- 
ply firs perpendiculars (VZ, PG, DE) starting with VZ and continually 
increasing [in length]; also let V/ F be a line such that, if any straight line 
EDF is drawn perpendicular to VD (cutting the curves at the points E, F, 


541 have omitted a portion of Barrow’s Figure 109. The curve ZGEG branches out, at 
least in the 1674 edition, about midway between Z and G. One branch goes left to Z, as 
shown, but the other goes to V. As much as is possible to judge from a reproduction, this 
omitted branch looks like a later addition to the original drawing. 

55 Child’s translation has A in place of V. Another translation can be found in Struik, A 
source book in mathematics, 1200-1800, pp. 255-256. 
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Fis 


Fig. to9. 


Fig. tice 


Fig. tr 


Dene 7X, 


Hujulmodi plura quadam cogitaram hic inferere , verium hae ex- 
iftimo fufticere fabindicando modo, juxtaquem, ciera Caleuli mslefti= 
um, curvarsm tangentes excuitere licet, unique conftructiones de- 
monifttare. Subjicutm ramen unum aur alrerum non afpernania, ut vie 
derue Theoremata perquam generalia. 


MI. SitlincaquepramZ GE, cujus axis VD, ad impri- 
amis applicita perpendiculares (V2, PG, DE) tb intloVZ tone 
Linwe orcringue crefeant , (it item linea V 1 F talis, we daca quicnng, 
retti ED Fad V Dpetpendiculari (quar cwroae fecet punttis F, ry 
ipfara V Din D) ficlemper rectangulumex DF, & delignaca qui- 
dam K aquale fpatio telpedtive intercepes VD EZ, fiat ancem DE . 
Db::R.D1T, & connectatur recta TF, hac curvam VIF 
continger, ¥ 

Sumarur enim in linea V1 F panétum quodpiam 1 Cillud primo fix 

pondium F, verfasinisiam V) & per hoc ducanturreske 1G ad 
VZ, ac K Lad VD parallel (qua lineas expolitas fecent, ut vides) 
eftqueromL F. LK: (DE. DT::) DE.R, adesque LP x 
R=LK*«DE. Bftamem (cx praftitura tinearum iftarum natura) 
LE x Raquile fpatio PDEG, ergo LKx DE=PDEG — 
Bao es. diechitd Sd tt alo 

Rarlos accipiatar quodvis punétum 1, infra p E, reliqua 
fant, uti pris , ficriiqoe jam plané difeue confabit fore L K xD 
=PDEGo-DPx« FT atning Green Lok 
quibus liquid jo paret totam reétam T K extra) curvam 
VIE J exiftere. liwdinaaY Z, PG, : 

Jifdem quoad cxtera pofitis, imate DE, ce. con- 
tinué decrefeant, eadem conelulio fimili ratiocinio colfigetur ; uni- 
cum obvenit D,/crimes, quod in hoceafr (contra quam in priore) 
linea V1 F concayas fuas axi VD obvertat. 

Corot. Notetar DE x D Tarquati {patio V D EZ. 


XIL Exindd deducitur hoc Theorems: Sint due Bree peste 
TE LE LED EA ee 
Plicata quavis » fais pes ae 
Pe peseaY D8 fanshoe sates = DE, & conneftaut FQ, 

pemaeit hs li Vie, reeF lens, is qualem 

aatur enim linea tl mox 

ws (ord vipa abn ps VEZ, 
hoc eft wt quadrata ex:applicatis a curva V K F in ptafente hypat 

mq: 


Theorem XI in the Geometrical lectures. 


and V D in D) the rectangle formed by DF, & a certain designated [length] R 
is always equal to the intercepted space VDE Z; also let. DE/DF = R/DT 


[DE.DF :: R.DT in Barrow’s own notation]; & join the straight line T F; 
then it will touch the curve V/F. 
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As usual, this needs interpretation, which we provide in three ways. First, 
we split the figur in two and fli the bottom curve vertically; second, we 
eliminate some of the clutter in the figur and in the statement of the theorem; 


A A z=F (x) 


y=f) 


x T x 


and third, we introduce some current notation as shown in the new figures 
With the aid of these, in which we have taken the liberty to replace Barrow’s D 
with x, we can restate the result as follows: 


XI. Let the curve ZGE represent an increasing function f; also let F be 
a function with graph VIF such that, for some constant R and at any x, the 
product RF (x) is equal to the area under the graph of f from V to x; and 
select T on the Vx axis such that 
XE: R 
sf OUaT 
Then TF is tangent to y = F(x) at (x, F(x)). 


Of course, some obvious conditions must be imposed on the curve. Omit- 
ting the proof, which is not our concern at this point, this means that the slope 
of the tangent to F at x is 


F(x) = xF _ XE = fx) 
xT R R 


The conclusion to be drawn from this is that, if R = 1, Barrow’s Theorem XI 
is the Fundamental Theorem of Calculus in the following version: if F is a 
function such that 


F(x) = / "pdt, 
Vv 


then F’(x) = f(x). 
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Lecture XII includes several results on quadratures and rectification in- 
cluding one that is of particular interest because it may be the firs of its kind 
that can be expressed as integration by parts. It is a result that is part of a 
collection of propositions in Appendix I, which Barrow included “to please 
a friend who thinks them worth the trouble” [p. 110; 165].°° It begins with 
a general foreword, from which—as well as from Barrow’s figure—w make 
the following extract [pp. 110-111; 165-167]: 


Let ACB be a Quadrant of a circle, and let AH, BG be tangents to it; & in 
... AC produced take ... CE ... equal to the radius CA; ... and let the 
hyperbola LE O through E have asymptotes BC, BG. Also let an arbitrary 
point M be taken in the arc AB, and through it draw CMS (meeting the 
tangent AH in S) ... and MPL parallel to AC. 


After the general introduction, Barrow gave a collection of propositions, 
the fift of which begins as follows: 


56 This friend was John Collins, as he revealed himself in a letter to Oldenburg of 30 
September 1676. 
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V. Take CQ = CP; anddraw QO parallel to CE (meeting the hyperbola 
LEOinO)... 


With this preparatory work, we are ready for the proposition that concerns us 
[p. 112; 167]. 


VIII. Let the curve AYY be such that FY is equal to AS; draw next a 
straight line YJ parallel to AC, the space ACIY YA (that is, the swm of the 
Tangents belonging to the arc AM, & applied to the straight line AC, together 
with the rectangle FCIY) is equal to half the hyperbolic space PLOQ. 


Literally, this says that the sum of the area under an arc of the graph of a 
tangent function (the curve AYY, whether it looks it or not) and the area of 
a certain rectangle equals half the area under some arc of a hyperbola. This 
literal interpretation may not inspire rave reviews at firs reading, but a second 
examination using today’s tools of the trade will reveal something that may 
be considered curious and even interesting. 

For convenience, we take the radius of the quadrant AC B to be 1, denote the 
angle ACS by 60, choose C as the origin of rectangular coordinates, assume 
that CA is located on the positive x-axis (directed upward) and that CB is 
located on the positive y-axis (directed to the left). Then we have 


CF =cos6), CA=1, and FY = —tan 6. 
Now, if D is an arbitrary point on the segment AS, denote the angle ACD 


by 6. Then the curve AYY is the graph of y = —tan@, and the area of the 
space AFYYA is 


1 cos 69 
‘ [0 — (—tan@)|dx = -{ tan 6 dx. 
c 1 


OS 9 


But we are mixing our variables here. It should be graphically clear that 
x = cos 6, and then the last integral becomes 


00 
-f tan @ d(cos@). 
0 


Next, the area of the rectangle FC/Y is 


FYxFC = tan cos 0. 
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Finally, since CP = sin, then CQ = — sin 6, and if we take the equation 
of the hyperbola LEO to be 


then the area of the space PLO Q is 


sin 69 1 sin 09 1 
/ (0 — ) dy = / dy. 
— sin 4 ys 1 — sin 0 1- y 
But y = sin 0, and then the area of the space PLO Q can be expressed by the 
integral 


90 1 2 + sin é 
fom” dd= ——~— dé 
_@ 1—sin@ ae — sin? 6 
% gin @ 
= | 008 35 Phe + dé 
0 COsé 


The second integrand is odd and the integral vanishes. The firs integrand is 
even and the integral equals 


0 
2 / cos 6 d(tan@). 
0 
Therefore, Barrow’s result can be written as 


ay 60 
-{ tan 0 d(cos @) + tan 69 cos 4 -| cos 0 d(tan @), 
0 0 


and this is integration by parts. 

Barrow’s results are about tangents and quadratures. But the subject that 
he put together with such care and completeness was so heavily cloaked that 
major doses of skill and will would have been necessary to penetrate the 
geometric shields erected by the author. Barrow may have played the role 
of a moth attracted by the brilliance and elegance of the prestigious synthetic 
geometry of Euclid, only to get burned in the process. 

At this time Barrow left mathematics for good to devote himself to the 
study of theology, and it was as a Doctor of Divinity that he became Master 
of Trinity College in 1672. 
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5.7 NEWTON’S METHOD OF INFINITE SERIES 


During the plague years Newton worked extensively on what would later be 
called the calculus, summing up his discoveries in a manuscript that has been 
called the October 1666 Tract, a 30-page manuscript that bears this date.°” It 
was at this very early stage that he discovered the rules of differentiation and 
some methods of quadrature, at least for some particular kinds of what we 
now call functions. 

For instance, in a manuscript written about 1665 he found the subnormal 
at a point on a curve that we would express by the equation 


pty) 
St ’ 
qy) 


where p and g are polynomials in y. Newton called the subnormal v, and 
gave it in the form of a quotient whose numerator and denominator are to be 
computed according to the following rule [p. 323]:58 


Rule 2°. If y is in y® rationall denom: of x consisting of many termes 
[our g(y)], for the y° Numerat: in y° valor [value] of v multiply y by y° 
denom of x.squared. for y° denominat" mu[1]tiply y° Num: of x according to 
its dimensions [see below for an explanation of these words] & y° product 
by y° denom:. againe multip] y° Denom: according to its dim: & y° product 
by y° numer’ & substract y° less from y° greater & divide y° diff by y. 


This says that the numerator of the expression giving v should be y[q(y)]’. 
Next, the “Num: of x according to its dimensions” means that each term of 
p(y) must be multiplied by the corresponding exponent of y. For instance, if 
p(y) = 6y? — 5y°, to write it “according to its dimensions” means to write 
18y? — 25y>. In general, the “Num: of x according to its dimensions” is what 
we would write as yp'(y). Thus, Newton’s rule can be written in the form 


a ylayP 
yp'(y)a(y) — ya'(y) PY) 
y 


57 First published by Hall and Hall in Unpublished scientific papers of Sir Isaac Newton. 
A selection from the Portsmouth Collection in the University Library, Cambridge, 1962, 
pp. 15—64 = Whiteside, The mathematical papers of Isaac Newton, 1, 1967, pp. 400—448. 

58 The portion of the manuscript that contains the work quoted here is reproduced in 
Whiteside, The mathematical papers of Isaac Newton, I, pp. 322-341. Page references are 
to this edition. 
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First edition of Newton’s De analysi. 
Reproduced from the virtual exhibition E/ legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla, 2000. 


It is not difficul to draw a figur ofa curve, its normal at a point, and a right 
triangle with legs v and y whose hypotenuse is an appropriate segment of the 
stated normal. Then, using current notation, it is easy tosee that v = y(dy/dx). 
It follows from the previous equation that 


dx 1 _ p'(yq) — 4’) PQ) 


dy dy — lam? 
dx 


This makes it clear that Newton had discovered a quotient rule for the subnor- 
mal in the case of polynomials at a very early stage in his work on what we 
now call the calculus. 

Later, after the appearance of Mercator’s Logarithmotechnia and the com- 
mentaries with improvements by Wallis and Gregory, and fearing that others 
“would fin out the rest before I could reach a ripe age for writing” (letter 
to Oldenburg of 24 October 1676) and would publish before him, Newton 
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quickly put down in writing all his discoveries on this subject in a manuscript 
entitled De analysi per equationes numero terminorum infinitas, probably his 
most famous mathematical work. It was circulated to a restricted circle in 
manuscript form in 1669, but remained unpublished until 1711. It may be 
fair to state that working in the setting of strict Euclidean geometry was not 
Newton’s cup of tea at this moment. He much preferred to obtain and express 
his results through the use of infinit series—power series to be specific—on 
term at a time. To deal with the general term, and referring to the figur on the 
firs page of the Latin edition, reproduced below (in which AB = x), Newton 
established the following Rule I (Regula I):°9 


an m+n 
x» = Area ABD. 
n 


Ifax™ = y, it shall be 
m 


He saved the demonstration for the end of the paper [pp.19—20; 243—245],© 
giving it with the aid of the next figure in which 


A B B 


the base AB = x, the perpendicular ordinate BD = y, & area ABD = z, as 
before. Similarly let BB = 0, BK = v, & the rectangle BBHK (ov) equal 
to the space BBSD. 

It is therefore AB = x +0, & AdB =z+ov. 


Then he made an assumption that would be crucial in his argument: 


59 This is an elaboration of Proposition 8 of the October 1666 Tract. See Whiteside, The 
mathematical papers of Isaac Newton, |, pp. 403—404. 

80 Page references given here are to Jones’ edition in Analysis per quantitatum series, 
fluxiones, ac differentias: cum enumeratione linearum tertii ordinis, 1711, and, after a 
semicolon, to Whiteside’s translation in The mathematical papers of Isaac Newton, II, 
1968. 
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If we now suppose Bf to be infinitel small, or o to be zero, v & y will be 
equal... 


Then he gave a small example before starting the proof proper in reverse, by 
taking as a hypothesis the conclusion in the statement of Rule I: 


Or in general if 


n m+n 
ax" =Z; 
m+n 
4 na 4 p A n 
or, putting —— =c&m-+n = p,ifcxn = z, or c"x? = 2”; then 


m+n 
substituting x + o for x & z+ ov (or, equivalently, z + oy) for z produces 


c" times x? + pox?"!, &c. =z" +noyz"!, &c, 
omitting of course the remaining terms which would ultimately vanish. 


That is, if x is replaced by x + 0, then the area under the curve increases 
from z = ABD to z+ ov (see figure) which is approximately equal to z + oy 
since o is very small. Thus, Newton replaced x with x + 0 and z with z + oy 
inc"x? = z", raised both sides to the nth power, expanded these powers, and 
announced his intention to omit the terms represented by “&c” because they 
contain negligible powers of o. Then he continued as follows: 


Now removing the equal terms c”x” & z”, and dividing the rest by o, there 
remains 


nyz" nyc" x? 
n p—1 n—-1 
c" px? =nyz (= ; i= P : 
CX 2 
Or, dividing by c”x?, it shall be 
= ny pelt) 
px = —>: or pcx 2 
Cxn 


= ny; 


na 
forc & m+n for p, that is, m for p —n & na for pc, it 


or, restoring 


m+n 
shall be ax = y. Conversely therefore, if ax 7 = y, it will make 
n m+n 
ax” =z. 
m+n 


Q.E.D. 


81 Tn the manuscript this statement reads “Si jam supponamus Bf esse infinit parvam, 
sive o esse nihil, erunt v & y zquales, ... ” in Whiteside, The mathematical papers of 
Isaac Newton, U, p. 242, but it is slightly different in Jones’ edition, p. 20. It is not correct 
to assume that Newton took over the o notation from Gregory, for he had already used it 
in his October 1666 tract on fluxions See Whiteside, The mathematical papers of Isaac 
Newton, I, pp. 414—415. Note, in particular, his statement: “And those terms are infinitel 
little in w" o is.” 
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If we were to interpret this in modern terms, we would write f(x) = 
c"x?P = z", which after incrementing becomes 


f(x to) = f(x) +noyz" | &e. 
Subtracting f(x) from both sides and dividing by 0, we obtain 


fe +o) f@) _ 


oO 


nyz" | &e, 


and omitting the terms represented by “&c” because they would ultimately 
vanish is equivalent to taking the limit as o — 0. In short, 


P 
z li f(x +0) — f(x) cxn p-l c P_4 m 
= im = Cc" px =—px"  =ax". 
y nz" 0-0 O nc"'xP , a 


Newton gave six examples of Rule I before stating Rule II [p. 2; 209], to 
the effect that if y is the sum of terms of the kind stated in Rule I, even if they 
are infinit in number, then the area is the sum of the areas corresponding to 
the individual terms. At that time, Newton must have thought that this was 
evident and in no need of proof. After several examples, he stated the third and 
last Rule [pp. 5; 211—213], basically asserting that if y “be more compounded 
than the foregoing [more than the sum of terms of the given kind], it is to be 
reduced to simpler terms.” He considered several possibilities and dealt with 
them by example. In this way, he found the area under y = aa/(b+x) by long 
division, obtaining an infinit series for the right-hand side and performing the 
quadrature term by term; for the curve y = ./aa + xx the binomial theorem 
gives a series expansion for the right-hand side; and in the more complex 


example 
V1 + ax? 
: V/1 — bx? 
both the binomial theorem and long division would bring home the bacon (the 
division is simpler if both numerator and denominator are multiplied firs by 
the denominator) [pp. 5—8; 213-219]. 

To pinpoint Newton’s contribution to the subject at this stage, it is clear 
that the method of using an infinitel small quantity o to perform what we 
recognize today as a differentiation was not new with Newton. In fact, we 
have seen the work of Fermat, which is quite similar. The sequence of steps 
by each of these two authors is as follows: Fermat—who used the letter F for 
the increment of the variable—adequates, removes common terms, removes 
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higher powers of E, divides by E, and gets the solution; Newton equates, 
removes higher powers of 0, removes equal terms, divides by o, and gets the 
solution. Gregory did much the same thing, other than adding the twist of 
introducing rectangles in his procedure. This priority was acknowledged by 
Newton, who admitted having obtained the idea for the method from Fermat 
and also mentioned the work of Gregory and Barrow.94 

Newton’s originality at this point consists in transforming a procedure 
known to work in isolated cases into a calculus of greater generality. While 
previous authors limited themselves to findin the tangent to one particular 
curve at a time, Newton was interested in a general method. And because of 
his use of infinit series, his method applies to functions with radicals, and 
that was new. Furthermore, a most important aspect of Newton’s originality 
consists in the interplay between what we now call differentiation and integra- 
tion, as we have seen in his proof of Rule I. Note how he found the function 
by differentiating the area under its graph from the origin to a variable point. 
That is, Newton was using what eventually became the fundamental theorem 
of calculus “and by applying it to abstract equations, directly and invertedly,” 
he made it widely applicable. Gregory’s and Barrow’s books also contain 
many results that, if properly interpreted, turn out to be theorems of analy- 
sis, including the fundamental theorem in both works (which Gregory applied 
“directly and invertedly” in the proof of his Proposition 54). But the heavy 
geometric fl vor of their presentation—almost incomprehensible today in Bar- 
row’s case—greatly limited the usefulness and applicability of their results. 
In Newton’s hands, this new subject became a practical calculus. 


5.8. NEWTON’S METHOD OF FLUXIONS 


Newton would reelaborate and amplify his view of the calculus in subsequent 
works, reinterpreting it firs as the fluxiona calculus, in a manuscript possibly 
entitled (the firs folium of the autograph is missing) Tractatus de methodis 


82 Tn the draft of a letter, now in the University Library, Cambridge (Add. 3968.30, 441’), 
Newton stated: “I had the hint of this method from Fermat’s way of drawing tangents and 
by applying it to abstract equations, directly and invertedly, I made it general. Mr. Gregory 
and Dr. Barrow used and improved the same method in drawing tangents.” See More, Isaac 
Newton. A biography, p. 185, n. 35. However, Whiteside, in The mathematical papers of 
Isaac Newton, I, p. 149, argues with good reason that “Newton had no direct knowledge of 
Fermat’s work but knew [only] the outlines of his tangent-method.” It is difficul to say, 
today, what Newton knew or did not know, but he claimed only to have had the hint of the 
method from Fermat and, as the French say, a bon entendeur, demi mot. 
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serierum et fluxionum (Treatise on the method of fluxion and infinit series). 
Newton tried to publish it beginning in 1671, but was unsuccessful. After a 
number of pages of preliminaries, in which Newton elaborated on such topics 
as long division of polynomials, he got around to considering equations in 
which some quantities are constant “‘to be looked on as known and determined 
and are designated by the initial letters a, b, c and so on,” [p. 73] 83 and others 
are variable: 


I will hereafter call them fluent and designate them by the fina letters v, x, 
y and z. And the speeds with which they each fl w and are increased by their 
generating motion (which I might readily call fluxion or simply speeds) I 
will designate by the letters 7, m, n and r. Namely, for the speed of the 
quantity v I shall put /, and so for the speeds of the other quantities I shall 
put m, n and r respectively. 


It would be difficul to remember that the speeds (fluxions at which v, x, y, 
and z fl w are designated by the letters /, m,n, and r. Newton would change 
the notation twenty years later when, in December 1691, he denoted those 
fluxion by 0, x, ¥, and z.©> This is particularly appealing if we think of the 
fluent as varying with time and of the dot as indicating derivative with respect 
to time, and we shall use this notation in our presentation even if it was 20 
years before its time in 1671. 

Then Newton posed the firs of twelve problems to be discussed in this 
new tract:®© Given the relation of the flowing quantities to one another, to 
determine the relation of the fluxions. Keep in mind while reading his method 
that it refers to an equation of the form now written as F(x, y) = 0 ora 
higher-dimensional equivalent [p. 75]: 


Arrange the equation by which the given relation is expressed according to 
the dimensions of some fl wing quantity, say x [that is, rearrange F(x, y) in 


63 Page references are to and translations from Whiteside, The mathematical papers of 
Isaac Newton, Ul, 1969, pp. 32-353. 

64 Three centuries before, Richard Suiseth had already used the words fluxus and fluens 
in this connection in his Liber calculationum, f. 9", col. 2 and f. 75”, col. 1. 

65 This notation appeared firs in print in the portion of a new tract, De quadratura cur- 
varum, that was included as an appendix to Wallis’ De algebra tractatus; historicus & 
practicus of 1693 (see below, pages 312 and 315, for details). Jones adopted these “dotted” 
fluxion in his 1711 printing of De quadratura curvarum, as did Colson in his translation of 
the fluxonia tract, so the practice of replacing Newton’s original fluxion with their dotted 
equivalents dates back to the early eighteenth century. 

86 Its notable length is due to the presentation of many examples, the subdivision of some 
of the problems into a number of subcases, and the consideration of related questions. 
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order of decreasing powers of x], and multiply its terms by any arithmetical 
progression [not any; he multiplied the terms by 0, 1, 2, 3,... from right 
to left; that is, he multiplied each by the exponent of x] and then by x/x. 
Carry out this operation separately for each of the fluen quantities and then 
put the sum of all the products equal to nothing, and you have the desired 
equation.®7 


To show how this is done he provided fi e examples, of which we select two. 
The firs and simplest is as follows: 


If the relation of the quantities x and y be x? —ax*+axy—y? = 0, I multiply 
the terms arranged firs according to x and then to y in this way. 


Multiply x? — ax? +.axy — y> | Mult. yi taxy — ax? +x3 
3x 2x x 3y } 

by ee, Oh by a 0 
x x x y y 

there comes 3xx* —2xax+xay x. | comes —3yy* + ayx *. 


And the sum of the products is 3x” — 2axx + axy —3yy? + ayx = 0, an 
equation which gives the relation between the fluxion x and y. Precisely, 
should you assume x arbitrarily the equation x? — ax? + axy — y? = 0 
will give y [not necessarily; it may not be solvable for y], and with these 
determined it will be 


x 3y? — ax 
~ 3x? — 2ax + ay 


This procedure resembles what we now call implicit differentiation. More 
complicated cases involving products, quotients, and roots can be solved by 
additionally using substitutions [p. 77]: 


b 3 
Example 4. Ifx?—ay?4 a, ay + x? expresses the relation between 
a+y 
x and y, I put z for by?/(a + y) and v for x*,/ay + x? and thence obtain the 
three equations 


x? —ay?+z-—v=0, az+yz—by? =0, & axty +x® —v? =0. 


87 This is an elaboration of Proposition 7 of the October 1666 Tract. See Whiteside, The 
mathematical papers of Isaac Newton, |, p. 402. 
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The firs gives 3xx? — 2ayy + 2-0 = 0,68 the second az + zy + yz 
— 3byy? = 0 and the third 4axx3y + 6%x° + ayx* — 2vv = 0 for the 
relations of the speeds v, x, y and z. 


After this, all he had to do was to solve the last equation for v, the preceding 
one for z, and take the results to 3xx* — 2ayy +z — 0 = 0. Then, if the values 
of z and v are restored, 


there comes the equation sought 
ae . Babyy? +2byy> = 4axxy + 6xx3 + ayx? 
3xx° — 2ayy4 = 0 


a +2ay + y? 2/ay + x2 


by which the relation of the speeds x and y is designated. 


Newton took this example and the idea of substitution from the October 1666 
Tract.©° This combination of the original method and substitution, which in 
this context is equivalent to an application of the chain rule, is quite power- 
ful, and permits what we now call implicit differentiation of very involved 
equations. 

After his fi e examples Newton provided a demonstration of his method. 
He chose the letter o to denote an infinitel small amount of time, in which x 
increases by xo and y increases by yo (Newton called these increments “mo- 
ments’’), and then he reasoned as follows [p. 80]: 


Given therefore any equation x? —ax*+axy—y? = 0, and substitute x +o 
for x and y + yo for y, and there will emerge 


x? + 3x0x? + 3x707x + x70 
— ax? — 2axox — ax’o’ 0 
taxy +axoy + ayox + axyo? ‘ 


a yey = eer ae 


68 As in the firs example; that is, 


xe—ay*>+z—0 —ay?+xe34+z-0 —v+x3 -—ay*+z zt+x3-ay*-—v 


3x 2y 

ys a "0 0 
x y v Zz 
3xx7 *. —2ayy? x. —vU x. ZO 


and so on for the others. 
89 Whiteside, The mathematical papers of Isaac Newton, I, pp. 411-412. 
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Now by hypothesis x? — ax? + axy — y? = 0, and when these terms are 
deleted and the rest divided by o there will remain 


5 9 5 5 5 5 
3xx7 + 3x70x + x30? — 2axx — ax?o0 + axy 


2 


3y°oy — yo? = 0. 


tayx +axyo—3yy 


And since above o is supposed to be infinitel small, so that it may express 
the moments of quantities, the terms that are multiplied by it will be worth 
nothing in respect to the others. I therefore reject them and there remains 
3xx?2 — 2axx + axy +ayx —3yy? =0, as in Example | above.” 


With this already settled (although present-day readers may have trou- 
ble accepting this as a demonstration), Newton was ready for Problem 3: To 
determine maxima and minima, which he solved as follows [p. 117]: 


When a quantity is greatest or least, at that moment its fl w neither increases 
nor decreases. For if it increases, that proves that it was less and will at once 
be greater than it now is, and conversely so if it decreases. Therefore seek 
its fluxio by Problem 1 and set it equal to nothing. 


EXAMPLE 1. If the greatest value of x in the equation 


ear bey yy =0 

be desired, seek the fluxion of the quantities x and y and there will come 
3xx? — 2axx + axy — 3yy* +ayx = 0. Then when x is set equal to zero, 
there will remain —3y* + ayx = 0 or 3y? = ax. With the help of this you 
might eliminate one or the other of x and y in the primary equation, and by 
the resulting equation determine the other, and then both by —3y? + ax = 0. 


Problem 4: To draw tangents to curves. This is more complicated because 
there was no single way of describing curves in Newton’s times. Or, as he 
put it, “Tangents are drawn in various ways according to the various relation- 
ships of curves to straight lines” [p. 121]. Thus, he considered nine modes 
and gave various examples of each. Mode | is presented by means of the 
following figure in which AB = x and the ordinate BD = y moves “through 
an indefinitel small space to the position bd...” [p. 123]. 


70 This is the same method already used in the October 1666 Tract to prove Proposition 7 
using the example x? — abx + a* — dyy = 0. See Whiteside, The mathematical papers of 
Isaac Newton, I, p. 414. Thus, we see that this tract contained the germ of Newton’s future 
work on the calculus. 


SECTION 5.8 NEWTON’S METHOD OF FLUXIONS 299 


Now let Dd be extended till it meets AB in 7: this will then touch the curve 
in D or d and the triangles dcD, DBT will be similar, so that 


TB De 

BD cd 
BD is known from the equation of the curve, and the fraction on the right is 
the ratio of the moment xo of x over the moment yo of y, which is equal to 
the ratio of their fluxions Then the preceding equation is solved for TB. As 


a firs example Newton returned to the curve x? — ax? + axy — y? = 0, for 
which he had already computed the fluxiona ratio 


x 35° — ax 
yp 3x2 — 2ax fay 


and then 
3y? — axy 


a= 
3x? — 2ax + ay 


Problems 5 and 6 of this tract on fluxion are devoted to curvature ques- 
tions, Problems 7 to 9 are on quadratures, and Problems 10 to 12 refer to 
rectification From all these we select two examples on quadratures. Newton 
began his discussion by referring to a figur like the one below, in which the 
perpendicular AC is 1, and denoted AB by x and the area ADB under the 
curve by z [pp. 195-197]. Ifthe segment BED moves a little to the right while 
remaining perpendicular to AB, the increments in the areas AB x BE = x and 
ABD = zareanarrow rectangle and a narrow quasi-rectangle with heights BE 
and BD. Therefore, these increments are (approximately) in the ratio BE/ BD. 
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A B 


Since the increments in x and z in a small amount of time are xo and zo, we 


obtain the basic formula 
BE 1 


BD BD 
“so that when we set x equal to unity, BD = z.” Thus, if BD is given by the 
equation of the curve we have z, and then it is possible, at least in principle, 
to fin z from z. 

Newton found an interesting use for this formula in Problem 8, where the 
following task is posed. Consider two curves FDH and GEJ, as shown in 
the next figure and establish the following notation: AB = x, BD = v, area 


Ne] ee 


D H 


A B A Cc 


AFDB =s, AC =z, CE = y, and area AGEC =t. If the curve FDH is 
given and a relationship between z and x is also given, fin a second curve 
GEI such that the areas s and ¢ are equal. 

To solve the problem we use the previous equation, which in the notation 
for the two new curves becomes 


Newton said [p. 199]: 


Hence if, as above, we assume x = 1, and v = §, it will be zy = ¢ and thence 
t 

== Vs 

Zz 
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A little while later he continued: 


By hypothesis therefore s = t and thence § = ft = v, and 


t v 
y = = = 
zZ z 
His firs example was the curve ax — xx = vv (from now on we write 


squares in the current fashion) and the relationship between the variables was 
given by ax = 2”. From Problem 1, a = 22z, and then 


[3 


2vz 
y= SS 
Zz a 


Since substituting x = z*/a in ax — x* = v? and simplifying gives 


we obtain 

“ /a@—2=y. 

a 

This is the “equation of the curve whose area equals the area of the circle.” 
It looks like the solution to a beautiful game, but there is a bit more to it 

than that. If those two curves enclose the same area, if A is the origin, and if 

we select b > 0, what we have in today’s notation is 


b Vab 9 2 
/ Vax — x? dx =f zo Be — 2 dz. 
0 0 


a 
More generally, if we write f(x) = Jax — x? and x = g(z) = z7/a, then, 
under appropriate hypotheses on g, 


b 


g l(b) 
: f(x) dx =| f (g(z))g' (z) dz. 


This is integration by substitution. 

As a firs example of Problem 9, To determine the area of any proposed 
curve, Newton considered the case of the hyperbola BD = x7/a, where a is a 
constant [p. 211]. Then, from the basic formula obtained before Problem 8, 
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“and there will (by Prob 2) emerge x7/3a = z.” For us this is just integration. 
For Newton, since Problem 2, From the relation of the fluxions determine the 
relation of the fluents, is the converse of Problem 1, “it ought to be resolved the 
contrary way” [p. 83], so let the following serve as an example of his contrary 
way: 


exe ea : 
I divide — [+ 0xx + 0x] | I divide z 
a 
. 3) = 
XxX Z Xx & 2 
by — making — by - making z 
X a Zz 
Then I divide by _ 3. 2. 1. | Then by 1. 
as 
making — making z. 
3a 


The total x7 /3a — z = 0 will be the desired relationship of x and z. 

This procedure requires what we now call indefinit integration, which is 
easy for polynomials. But Newton was well aware, way before we were, of 
the fact that this problem “cannot always be resolved by this practice” [p. 85]. 

After this, Newton assumed the validity of Rules | and 2 already presented 
in De analysi and relied on the method of infinit series to perform what we 
would call term-by-term integration in numerous examples. 

It should be said that our selections from the tract on series and fluxion do 
not do justice to the power of Newton’s methods. Newton’s paper contains a 
wealth of modes, cases, and examples that show the full range of his methods, 
plus an extensive table of areas (table of integrals) [pp. 236—241 and 244—255] 
in the part of the Tractatus devoted to quadratures. 

It is clear now that Newton’s work represents an essential change, having 
replaced the geometric approach of his predecessors with an algebraic one. 
This represents a totally new general approach and supersedes the considera- 
tion of particular cases. 


5.9 WAS NEWTON’S TANGENT METHOD ORIGINAL? 


When Newton provided his demonstration of the rule for “implicit differen- 
tiation” as applied to the equation x* — ax? + axy — y? = 0 (page 297) he 
did not give any credit to Barrow, but this is exactly the method of tangents 
that Barrow showed Newton “before he inserted it into his 10th Geometrical 
Lecture.” Referring back to this method and to the figur on page 281, if the 
coordinates of N are x and y, then the coordinates of M are x + e and y +a. 
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Since the point M is also on the given curve, substituting x + e for x and y+a 
for y, “there will emerge” the same equation that Newton obtained but with e 
in place of xo and a in place of yo. By Barrow’s Rule 1, all terms containing 
powers of a or e are to be omitted, which is equivalent to rejecting the terms 
that still contain o after division by o. By Barrow’s Rule 2, terms that do not 
contain a or e are to be rejected, which is equivalent to deleting the terms 
x? — ax? +axy— y>. Thus Barrow’s method and Newton’s basic method are 
one and the same. 

On this, Newton had the following comment in a letter to Collins of 10 
December 1672:"" 


I remember I once occasionally told Dr. Barrow when he was about to pub- 
lish his Lectures that I had such a method of drawing Tangents but some 
divertisement or other hindered me from describing it to him. 


But we don’t have to take this statement as proof that Newton already “had such 
a method of drawing Tangents” when he assisted Barrow in the publication of 
the Lectiones. In a manuscript written in September 1664,’ after moving all 
the terms of an equation “to one side soe y' it be = 0,” he wrote the following 
explanation of his method: 


Multiply each terme of y° equat: by so many units as x hath dimensions in y' 
terme [that is, multiply by the exponent of x], divide it by x & multiply it by y 
for a Numerator. Againe multiply each terme of y° equation by soe many 
units as y hath dimensions in each terme & divide it by —y for a denom: in 
the valor of v.73 


He gave three examples of this method, the third of which is this: 


4yx? — 2y>x + aayy 
= 7 


And if x+ — yyxx + aayx — y+ = 0. then 


4y3 — aax + 2yxx 


It is easy to carry out his instructions to obtain this result and to verify it using 
today’s implicit differentiation. 


7 Turnbull, The correspondence of Isaac Newton, 1, 1959, p. 248. 

72 Whiteside, The mathematical papers of Isaac Newton, I, pp. 236-238. 

73 By v, which Newton define via a figur that is unnecessary to us, he meant what we 
now would write as yy’. Thus, the last statement in the quotation means that the quotient 
so constructed equals yy’. 
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The idea of the method of tangents for what we call “implicit functions” 
had come to the mind of many in the second half of the seventeenth century. 
Barrow and Newton were not the firs nor would they be the last. The Dutch 
mathematician Johann Hudde had already described such a “General Rule” to 
Frans van Schooten, in a letter in Dutch of 21 November 1659:”4 


Arrange all the terms of the equation that expresses the nature of the curve, 
in such a manner that they are = 0 and remove from this equation all the 
fractions that have x or y in their denominators. Multiply the terms of the 
highest order in y by an arbitrarily chosen number, or even by 0, and multiply 
the term of the next highest order in y, by the same number minus one unit, and 
continue in the same manner with all other terms of the equation. Similarly 
multiply by an arbitrarily chosen number or by 0 the term of the highest order 
in x: the term of the next highest order in x, must be multiplied by the same 
number minus one unit, and the same with the others. When the firs of these 
products is divided by the second, the quotient multiplied by —x is AC. 


Hudde’s only example was this (the asterisk is just a place holder for a nonex- 
istent term in y): 


Let the equation that expresses the nature of the curve be 


x 
ar tay +h yr ay y? « +2x4- 4ab> =0 


2a 
1. Multip.by 1. +1. 0. 0. 0. -1.-2.  -2. 
2. Multip. by 0. +1. O. 42. +3. + 4. 0. 
1. Yields ay> + xy? — 4x4 + 8ab> 
3 2 
2. Yields +xy3 2x? y? * y? + 8x4 
2a 
3 Bh 4 4 8 b 
consequently, AC = ay AY ’ at : times — x./6 
3x" 4 
xy? — 2x2 y2 y + 8x 
2a 


74 Tt was later translated into French as “Extrait d’une lettre du feu M. Hudde a M. van 
Schooten, Professeur en Mathematiques a Leyde. Du 21. Novembre 1659,” 1713. My 
English translations are from Gerhardt’s reprint. 

75 As in Newton’s case, Hudde included a figure and the segment AC is what we would 
recognize today as y/y’. 

76 This is the result as given by Hudde, but the industrious reader who wants to use 
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It is easy to observe, and Hudde was quite clear on this point and naturally 
proud of it, that the progression of multipliers used in his method is completely 
general in the selection of the starting number (on the right of the steps labeled 
“Multip.” above he wrote “or by any other Arith. progr.”). This makes the 
method simple to use because one can arrange matters so that the multiplier 
of the power with the largest number of terms is zero: 


Which has been seen in the preceding example, in which 0 has been placed 
firs under y* and, second, under the terms that do not contain x. 


But it was René Francois Walther de Sluse (1622-1685), from Liege, who 
developed this method first according to the following statement by John 
Collins in a letter to Newton of 18 June 1673: ’” 


As to Slusius Method of Tangents consider this: it was well understood by 
him when he published his book de Mesolabio.’® But he did not want to 
make it public then because he was unwilling to prevent his Friend Angelo 
Riccio.’9 


By 1671 Sluse had been advised that Ricci was otherwise occupied and no 
further mathematical output was to be expected from him. Moreover, Sluse 
himself had “fallen upon a very easy method of proving things” regarding the 
method of tangents. Both of these statements are contained in a letter from 
Sluse to Oldenburg of 17 December 1671. He announced then his intention 
to publish it the next year, and in another letter of 17 January 1673 he sent it 


Newton’s method or today’s implicit differentiation to verify that they lead to the same 
conclusion may get in a spot of trouble. To avoid it, it must be observed that the equation 
of the curve gives 


3 
3 


x 
Sab" 4g" = Day" 4 2x5? 4 by Oxy go 
so that the last numerator becomes 


3 
x 

3ay> + 3xy° +2b°y* — 2x y? - —y3, 
a 


Then the three methods lead to the same result. 

77 Turnbull, The correspondence of Isaac Newton, I, p. 288. 

78 Mesolabum et problemata solida, 1659. 

79 Michelangelo Ricci was a friend and pupil of Torricelli, who published 19 pages of 
mathematics on maxima and minima as Michaelis Angeli Riccii geometrica exercitatio, 
1666. 
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to Oldenburg for submission to the Royal Society. In this letter Sluse, who 
wrote y for x and v for y, gave the following general rule: 


1. Drop from the equation those parts [terms], which do not contain either y 
or v, placing on one side all the terms containing y, & on the other those that 
contain v, with their signs + or —. For the sake of simplicity, we call this 
the right-, and that the left-hand side. 


2. On the right-hand side, set in front of each term the exponent to which v 
is raised, or, what is the same, extend the terms in that way. 


3. Do the same on the left-hand side, but preceding each term with the 
Exponent of the power of y. But & this is done throughout: One of the y’s 
in each term is changed into an a [the notation he used for the subtangent, or 


v/v’). 


On the next page Sluse gave several examples of the application of this 
rule, from which we select that of the curve y> + by* = 2qqv? — yyv*, which 
fully shows the procedure. Step 1 leads to the equation y> + byt + vyy = 
2qqv° — yyv>. Note that the term yyv? must be included in both sides because 
it contains both y and v. Multiplying each term by the exponent of y on the left- 
hand side and by the exponent of v on the right-hand side, the equation becomes 
Sy? + 4by* + 2u° yy = 6qqu> — 3yyv". Finally, changing one y in each term 
on the left-hand side into a we obtain Sy4a+4by*a+2v° ya = 6qqv? —3yyv’, 
and 

6qqu? — 3yyv? 
c= ; 
5y4 + 4by3 + 2v3y 
Sluse got this far, but if we remember that a = v/v’ we can write 


, Sy*+4by? + 2v7y 
v= . 
6q2v2 — 3y2v2 


The same result is obtained by any of the other methods. 

In conclusion, Barrow, Newton, Hudde, and Sluse independently devel- 
oped what is essentially the same method of tangents. But Newton’s method 
was the most powerful because, by his use of substitutions, he could deal with 
radicals. 


5.10 NEWTON’S FIRST AND LAST RATIOS 


Newton’s creativity with respect to the calculus had probably reached its peak 
with the preceding works, but he had published nothing. Then his attention 
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turned to matters of natural philosophy (which we call physics today), and in 
1687 he published his immortal book, the one that made Newton the figur 
that he is: Philosophie naturalis principia mathematica.®° 


|| PHILOSOPHIE 


NATURALIS 


PRINCIPIA 
||| MATHEMATICA. 


Antore FS. NEWTON, Trin. Coll. Cantab. Soc. Mathefeos 
re Peatine Lucafiano, & Societatis Regalis Sodali. 


| IMPRIMATUR- 
| S PEPYS, Reg. Sw. PRESSES 
Fulti 5. 1686. 


LONDINI, 
Jolla Societmis Regie ac Typis Fofephi Streator. Proftant Vena~ 
Jesaped Sau, Sovithad iliznia Principis Wallie in Cocmirerio 
D. Pani, aliofg; nonnullos Bibliopolas. AnnyMDCLXXXVIL 


Reproduced from the virtual exhibition El legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla, 2000. 


While the main thrust of this book is about physical matters, it touches 
upon some of the concepts of the calculus, but in a very different language 
from the one he had used before. The firs publication of Newton’s calculus 
may have been a disappointment to those who already knew his work, if they 
understood at all that he was talking about that subject. There are, in his new 
presentation, no maxima and minima, no implicit differentiation, no infinit 
series, no rectification little of what his close colleagues had come to expect 
and admire. 


80 Translated into English by Andrew Motte as The Mathematical principles of natural 
philosophy, 1729. Page references are to this translation, of which there are recent reprints. 
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The new approach hits the reader with the force of unexplained language, 
for the title of Book I, Section I is [p. 41] Of the method of first and last ratio’s 
of quantities, by the help whereof we demonstrate the propositions that follow. 
Of these we present a selection. The firs one is preliminary and quite general 
[pp. 41-42]: 


Lemma I. 


Quantities, and the ratio’s of quantities, which in any finite time converge 
continually to equality, and before the end of that time approach nearer the 
one to the other than by any given difference, become ultimately equal. 


The second proposition is about quadratures and we include the proof 
[pp. 42-43]: 


LEMMA II. 


Ifin any figure AacE terminated by the right lines Aa, AE, and the curve acE, 


A BF CDE 


there be inscrib’d any number of parallelograms Ab, Bc, Cd, &c. compre- 
hended under equal bases AB, BC, CD, &c. and the sides Bb, Cc, Dd, &c. 
parallel to one side Aa of the figure; and the parallelograms aKbl, bLcm, 
cMdn, &c. are completed. Then if the breadth of those parallelograms be 
suppos’d to be diminished, and their number to be augmented in infinitum 
I say that the ultimate ratio’s which the inscrib’d figure AKbLcMdD, the 
circumscrib’d figure AalbmcndoE, and curvilinear figure Aabcd E, will have 
to one another, are ratio’s of equality. 


For the difference of the inscrib’d and circumscrib’d figure is the sum 


SECTION 5.10 NEWTON’S First AND LAsT Ratios 309 


of the parallelograms K/, Lm, Mn, Do, that is, (from the equality of all 
their bases) the rectangle under one of their bases Kb and the sum of their 
altitudes Aa, that is, the rectangle ABla. But this rectangle, because its 
breadth A B is suppos’d diminished in infinitum, becomes less than any given 
space. And therefore (by Lem. I) the figure inscribed and circumscribed be- 
come ultimately equal one to the other; and much more will the intermediate 
curvilinear figur be ultimately equal to either. Q.E.D. 


The rectangles in Lemma IJ are assumed to be “comprehended under equal 
bases,” but this restriction was lifted in the following [p. 43]: 


Lemma III. 


The same ultimate ratio’s are also ratio’s of equality, when the breadths AB, 
BC, CD, &c. of the parallelograms are unequal, and all are diminished, in 
infinitum 


For suppose AF equal to the greatest breadth, and compleat the par- 
allelogram FAaf. This parallelogram will be greater than the difference 
of the inscrib’d and circumscribed figures but, because its breadth AF is 
diminished in infinitum, it will become less than any given rectangle. Q.E.D. 


From this result, Newton was able to obtain four corollaries. 


Cor. I. Hence the ultimate sum of those evanescent parallelograms will 
in all parts coincide with the curvilinear figure 


Cor. 2. Much more will the rectilinear figure comprehended under the 
chords of the evanescent arcs ab, bc, cd, &c. ultimately coincide with the 
curvilinear figure 


Cor. 3. And also the circumscrib’d curvilinear figur comprehended un- 
der the tangents of the same arcs. 


Cor. 4. And therefore these ultimate figure (as to their perimeters ac E,) 
are not rectilinear, but curvilinear limits of rectilinear figures 


The remaining selections are on tangents [pp. 45—47] and use the following 
figure 5" 


81 Newton’s own figur contains additional letters, segments, and one arc that are of 
interest in the proof of Lemma VIII, a result not discussed here. 
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LEMMA VI. 


If any arc ACB given in position is subtended by its chord AB, and in any 
point A in the middle of the continued curvature, is touch’d by a right line AD, 
produced both ways; then if the points A and B approach one another and 
meet,°* I say the angle BAD, contained between the chord and the tangent, 
will be diminished in infinitum and ultimately will vanish. 


LEMMA VII. 


The same things being supposed; I say, that the ultimate ratio of the arc, chord, 
and tangent [the segment AD], any one to any other, is the ratio of equality. 


We could think ofa rectangular coordinate system in which the coordinates 
of A are (x, y) and those of B are (x + Ax, y + Ay). The slope of the chord 
AB is Ay/Ax and, since the ultimate ratio of the segments AB and AD is the 
ratio of equality, the slope of the tangent AD is the last ratio of the quantities 
Ay and Ax. 

At the end of Section I there is a Scholium in which Newton states why he 
chose to give geometric proofs, which was basically to conform to tradition 
and avoid criticism, although he did not put it that way. Then he anticipated 
an objection that undoubtedly would arise, as it did [pp. 54—55]: 


Perhaps it may be objected, that there is no ultimate proportion of evanescent 
quantities; because the proportion, before the quantities have vanished, is not 
the ultimate, and when they are vanished, is none. 


and explained that [p. 55] 


82 Imagine the segment RB D decreasing in length and approaching RA as R is kept fi ed 
and B and D approach A. In this approach, B remains on the fi ed arc AC. 
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by the ultimate ratio of evanescent quantities is to be understood the ratio of 
the quantities, not before they vanish, nor afterwards, but with which they 
vanish. 


To clinch it, near the end of the Scholium he added [p. 56] 


For those ultimate ratio’s with which quantities vanish, are not truly the 
ratio’s of ultimate quantities, but limits towards which the ratio’s of quantities, 
decreasing without limit, do always converge; ... 


No, he did not have a definitio of limit, but he clearly had the idea and his 
language in this statement is the one we use today. In this respect he was 
closer than his predecessors and contemporaries, and closer than most of his 
successors for over a century, to the modern definitio of derivative in terms 
of limits. 

Then in Section II of Book IH, On the motion of bodies, he inserted the 
following [vol. I, p. 17]: 


Lemma II. 


The moment of any Genitum is equal to the moments of each of the generat- 
ing sides drawn into the indices of the powers of those sides, and into their 
coefficients continually. 


This is far from clear. What is a Genitum, to start with? Newton explained 
[pp. 17-18]: 


Quantities of this kind are products, quotients, roots, rectangles, squares, 
cubes, square and cubic sides, and the like. 


Indeed, that is what the lemma proper does, to evaluate what we call the 
derivatives [moments] of products, powers, roots, quotients, and the like. 


Wherefore the sense of the Lemma is, that if the moments of any quantities 
A, B, C; &c. increasing or decreasing by a perpetual flux or the velocities 
of the mutations which are proportional to them, be called a, b, c, &c. the 
moment or mutation of the generated rectangle AB will beaB+ BA; ... 


n-m 


in general, that the moment of any power A ni will be “aA. 
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5.11 NEWTON’S LAST VERSION OF THE CALCULUS 


In 1691, about to turn 49, Newton was ready to collect his thoughts on the 
calculus and give the world a mature presentation of his work on this subject. 
In November he started writing a tract entitled De quadratura curvarum that he 
left unfinished In December he wrote and finishe a revised and augmented 
version, which he left untitled but is usually known by the same name as the 
unfinishe tract.®? It begins by recalling the 1676 letter to Leibniz about the 
binomial theorem, and just before stating his firs theorem he wrote [p. 51]: 


On this basis I have tried also to render the method of squaring curves simpler, 
and have attained certain general theorems. 


The firs of these is to perform the quadrature of the curve whose general 
ordinate is dz°(e + fz")* [p. 51], where z is the variable, d, e, and f are 
constants, and 6, 7, and 4 are just exponents. The general ordinate is firs 
expanded using the binomial theorem and then integrated term by term as in 
De analysi. In general, we should say that there is not much in the treatise 
that conceptually advances the calculus per se. In Proposition IV, Problem I, 
“Given an equation involving any number of quantities, to fin the fluxions 
and vice versa” he repeated the method already used in the Tractatus fluxionum, 
but there is a novelty: the use of “pricked” letters for the fluxion [p. 65]: 


Let a, b, c, d, e be determinate, unalterable quantities; v, x, y, z fluen 
quantities, that is indeterminate ones increasing or decreasing by a perpet- 
ual motion; v, x, y, z, their fluxions namely, their speeds of increasing or 
decreasing .. . 84 


It is Newton’s proficien y in performing quadratures that shows brilliantly 
in this tract. This is shown by the following imposing result, which will need 


83 Both of these versions, with English translations, were firs published by Whiteside in 
The mathematical papers of Isaac Newton, VII, 1976, pp. 24—48 and 48-129. Quotations 
are from and page references to this work. 

84 Tn some preparatory drafts for De quadratura, Newton played around with some con- 
densed notation for higher-order fluxions For instance, he wrote 


71 62 53 44 
2bzz — 12 — 30 — 20, 


where z with an integer n on top means the nth fluxio of z, and the last three terms are just 


short (excessively short) for — 12b2 z - 30b22 - 20b2 2, See Whiteside, The mathematical 
papers of Isaac Newton, VII, pp. 162—163. 
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some explanation.®® 


PROP. V. THEOR. III 


If the Curve’s abscissa AB be z, & for e+ fz" +27" +hz>" + &c. write R: also 
0 
let the Ordinate be z°-' R’—! x a+ bz" +.cz*" +. dz*" + &c. & put — = 1, 
1) 
r+rA=s,strA=tjt+rA=v, &e. 
1 


The Area will be = z° R* times + — 


re 
dp 
Prd fA 
r+l1xe 
ga ee, 
r+2xe 
lg—s4+2x fC—t+I1xgB-—vhA 
aye zn 
r+3xe 
il [eee oe eee ee, 
r+4xe 
+ &.%6 


Here A, B,C, D, &c. denote all the given coefficients of the separate terms 
in the series with their + & — signs, namely 
1 


<a 
A the coefficient ~ of the first term 


re 
Lp eA 

B the coefficient + of the second term 
r+1xe 


1 ae I 
-c—s+1x fB-—tgA 
C the coefficient + 


——— of the third term 
r+2xe 


And so on hereafter. 


85 | have reproduced this proposition and the example following it from pages 49 to 51 of 
Jones’ edition of De quadratura of 1711. See the bibliography for the full reference. 

86 There is a missing term in the coefficien of z*”—replaced here with [| }—in the 1711 
statement of the theorem. The reason is that Newton had originally stated this term to be 
re but then realized that the letter e, which is naturally expected after d, had already been 
used in the expression for R. For some reason, he deleted the originally written term rather 
than correct it, and it went to the printer in this manner. 
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We skip the demonstration that Newton provided. The explanation consists 
in definin firs the expressions 


R=et fz? + 92" +hz" + &e. 
and 

S=atbz" +27" + dz" + &e. 
(Newton had used the letter S for a similar purpose in the previous proposition, 
so we use it here), where 7 is just a constant used to construct the exponents. 


So are 6 and A, which in Newton’s examples were either integers or rational 
numbers. Then defin new numbers 


7) 
r=-, ss=rt+aA, t=staA, vs=tt+a, &e. 
U) 


With the notation so established, the proposition states that 


1 1 1 
aa —b—sfA c—(s+1)fB—tgA 
re (r+1)xe (r+ 2)e 


2n 


7a = +2) fC—C+ gk = vhA 
+ 


(432 ails ee. 


and this is an integration formula (not included in today’s calculus textbooks). 
Newton’s firs example, which is necessary for a full understanding of what 
is going on, was to perform the quadrature of 


3k — Izz 
zeal kz — Iz3 + mzt 


where k, /, and m are constants. Using present-day notation, this can be 
rewritten as 


it 3 1 
z (ke — I? + mz$) 23k — 12) = Bk 12 + mz)? "Bk — 12’). 
Then, in the notation of the proposition, 


a= 3k,b=0,c=-—l,e=k, f =0,g =—-lh=m,r4=3,n=1, 
0 


a pe, 8 a ee ee 
é-l= pO=—-ZHr,sa=—lt= 5,0 =0. 


Then A = —2, B = 0 because b = f = 0, C = 0 because c — tgA = 
—Il+ $(—1)(—2) = 0, and, in the same manner, 
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all the terms after the firs vanish to infinit & the area of the Curve comes 


forth 
/ k —Iz* +m23 
2 5 . 


&j 


After a few more equally impressive results on quadratures, Proposition XI 
is about recovering the fluent from their fluxions involving quite a number of 
cases and examples. Proposition XII and its corollaries give the Taylor series 
expansion, as already discussed in Chapter 4, and then Newton went on to 
solve problems on motion and centers of gravity. 

Although Newton took the almost completed manuscript of the quadratura 
on a three-week visit to London in the winter of 1691/1692, and there he 
expressed an interest in publishing it soon, the fact is that he lost interest in 
this matter, and the manuscript remained unpublished in this form until 1976, 
three centuries after he wrote the letter to Leibniz with which he began the 
tract. However, Newton sent two extracts from it (now lost) to Wallis in the 
summer of 1692, and Wallis published them in his very extended Latin version 
of his Treatise of algebra, both historical and practical of 1685.8’ This was 
the firs printed appearance of Newton’s pricked letters, a notation rapidly 
adopted by large sections of the mathematics world. 

But Wallis could not, during his lifetime, persuade Newton to publish the 
entire De quadratura tract. Instead, he wrote a severely truncated version 
of it in 1693 (without the Taylor or Maclaurin expansions), which, retitled 
The rational quadrature of curves, became Book 2 of his projected, but never 
published, treatise on Geometry. But this manuscript was not destined, as so 
many others, for oblivion. In 1703, Newton took it out of whatever drawer 
or shelf in which it was gathering dust, wrote an Introduction and a Scholium 
for it, and set out to publish it. Why? What was Newton’s motivation for 
such zeal in publishing one of his mathematical works while allowing others 
to remain dormant? Very simply, he had been stung by a plagiarist. Newton 
was about to publish his Opticks when the incident took place, and decided to 
append to it the abridged De quadratura as the second of Two treatises of the 
species and magnitude of curvilinear figures. The following is a part of what 
he had to say at the end of the “Advertisement” to the volume: ®° 


And some Years ago I lent out a Manuscript containing such Theorems, 
and having since met with some Things copied out of it, I have on this 


87 The extracts can be seen in Wallis, Opera mathematica, 2, 1693, pp. 390-396, and the 
pricked letters appear on page 392. See the bibliography for reprints. 
88 From Whiteside, The mathematical papers of Isaac Newton, VIL, 1981, p. 92. 
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Occasion made it publick, prefixin to it an Introduction and subjoyning a 
Scholium concerning the Method.8° And I have joined with it another small 
Tract concerning the Curvilinear Figures of the Second Kind, which was also 
written many Years ago... 


It is in the new Introduction that we fin Newton’s newest point of view 
on the calculus, starting with the assertion [p. 41; 123]: 


Mathematical Quantities I here consider not as consisting of least possible 
parts [“indivisibles” in an earlier draft], but as described by a continuous 
motion. 


Then he made a statement that may be viewed as a glimpse into the concept 
of limit [pp. 42; 123-125]: 


Fluxions are very closely near as the augments [increments] of their Fluents 
begotten in the very smallest equal particles of time, &, to speak accurately, 
they are in the firs ratio of the nascent augments ... 


which, if the fluent are denoted by x and y, we could explain by writing 


Newton explained it as follows, referring to the next figur (we have inserted 
arc symbols in the text for clarity) [p. 42; 125]: 


Let the ordinate BC advance from its place BC into a new place bc. Complete 
the parallelogram BC Eb, and draw the tangent line VTH to touch the Curve 
at C and meet bc & BA extended in T & V : & the augments of the Abscissa 
AB, Ordinate BC & Curved Line ACc now begotten will be Bb, Ec & Cc : 
& in the firs ratio of these nascent augments are the sides of the triangle 
CET, consequently the fluxion of AB, BC & AC are as the sides CE , ET 
& CT of that triangle CET, & can be expressed by means of the same sides, 
or what is the same by the sides of the triangle VBC similar to it. 


It comes to the same if the fluxion be taken in the last ratio of the vanishing 
parts. Draw the straight line Cc & produce it to K. Let the Ordinate bc 


89 The new introduction and scholium can be seen in Whiteside The mathematical papers 
of Isaac Newton, VIL, pp. 123-159, while the revisions to the text proper are in pp. 92-105. 
Double page references in the rest of this section are to Jones’ Latin edition first from which 
the quotations are taken, and then to Whiteside’s translation. 
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go back into its former place BC, & as the points C & c come together, 
the straight line CK will coincide with the tangent CH, & so the vanishing 
triangle CEc will as it attains its last form end up akin to the triangle CET, 
& its vanishing sides CE, Ec & Ce will ultimately be to one another as are 
the sides CE, ET & CT of the other triangle CET, & in this proportion in 
consequence are the fluxion of the lines AB, BC & AC. If the points C & 
c are at any small distance apart from each other the straight line CK will be 
a small distance away from the tangent CH. In order that the line CK shall 
coincide with the tangent CH & so the last ratios of the lines CE, Ec & Cc 
be discovered, the points C & c must come together & entirely coincide. The 
most minute errors are not in Mathematical matters to be scorned. 


In this we can see the evolution of Newton’s thought regarding the problem 
of tangents. In De analysi, at the age of 26, he did not hesitate to accept 
infinitel small quantities, while in the published version of De quadratura, 
at 61, he had come to realize that they are not acceptable. Since the writing of 
the Principia he used the notion of firs and last ratios instead of infinitesimals 
This new concept is not properly defined and, of course, the finit ratios (for 
instance, Ec/CE) do not approach an ultimate ratio, but just a number. He 
knew that, as we have seen above. 

It will be interesting to show how he dealt in De quadratura with a case 
already presented long ago in De analysi, that of a simple power [pp. 43—44; 
127-129]: 


Let the quantity x flow uniformly & the fluxion of the quantity x" need to 
be found. In the time that the quantity x comes in its flu to be x + 0, the 
quantity x” will come to be (x +0)", that is [when expanded] by the method 

nn—n 


ofinfinit series, x”-++-nox"—!+ oox"-?+ &c. Andso the augments 0 
g 
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& 

nox" | 4 => a oox" > + &c 
are one to the other as 1 & nx""! 4 — "ox"? 4+ &c. Now let those 
augments come to vanish, & their last ratio will be 1 to nx"~! : consequently 


the fluxio of the quantity x is to the fluxio of the quantity x” as 1 to nx”~!. 
In the next paragraph he added [p. 44; 129]: 


By similar arguments there can by means of the method of firs & last ratios 
be gathered the fluxion of lines whether straight or curved in any cases 
whatever, ... & to investigate the firs and last ratios of nascent or vanishing 
finites is in harmony with the Geometry of the Ancients : & I wanted to show 
that in the Method of Fluxions there should be no need to introduce infinitel 
small figure [quantities] into Geometry. 


5.12. LEIBNIZ’ CALCULUS: 1673-1675 


Starting in 1673, Leibniz investigated problems on tangents and quadratures in 
some manuscript papers that remained unpublished for almost two centuries. 
They are in the nature of notes for his own use, possibly made as he read the 
works of Descartes, Sluse, St. Vincent, Barrow, and James Gregory among 
others. In this way he developed his own ideas, discovered some interesting 
formulas, and introduced his very famous notations d and |. These appeared 
firs in a manuscript of 29 October 1675, which is part of a collection of three 
manuscripts under the common title Analysis tetragonistica ex centrobarycis 
(Analysis of quadratures by means of centers of gravity).9° These manuscripts 


90 Copies of Leibniz’ mathematical manuscripts are available from the Gottfried Wilhelm 
Leibniz Bibliothek / Niedersdchsische Landesbibliothek (Lower Saxony State Library), Han- 
nover. A complete catalog (not completely up to date) can be found in Bodemann, Die 
Leibniz-Handschriften der Kéniglichen Offentlichen Bibliothek zu Hannover, 1895, and 
the bibliography contains references to this catalog for Leibniz’ manuscripts. Many of 
these manuscripts have been subsequently printed in Die Entdeckung der differentialrech- 
nung durch Leibniz, 1848; Die Geschichte der héheren Analysis, erste Abtheilung, Die 
Entdeckung der héheren Analysis, 1855; Der Briefwechsel von Gottfried Wilhelm Leibniz 
mit Mathematikern, 1899 (all edited by Gerhardt); and in Sdimtliche Schriften und Briefe, 
Ser. VIL, 5 (edited by Probst, Mayer and Sefrin-Weis). English translations of many of these 
manuscripts can be found in Child, The early mathematical manuscripts of Leibniz; trans- 
lated from the Latin texts published by Carl Immanuel Gerhardt with critical and historical 
notes, 1920. References given in the text for Leibniz’ manuscripts are to the manuscript 
folio first and then by page to the printed versions listed in the bibliography, in the order 
listed there. 
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GOTTFRIED WILHELM LEIBNIZ 


Stipple engraving by Benjamin Holl, 1834. 
Lithograph from the author’s personal collection. 


were not meant for publication, and exposing their contents to the public view 
may fall under the heading of indiscretion, to say the least. Nevertheless, 
curiosity killed the cat. 

The firs of these manuscripts is dated October 25 and 26, and in the second 
part we fin asignifican result. Before presenting it, it is convenient to explain 
some of Leibniz’ notation at that time. First, he used the symbol N as an equal 
sign and then he chose the abbreviation omn. of the Latin word omnia [omnes 
in Leibniz’ usage] to mean “all,” although we may prefer to read it as “the sum 
ofall.” Thus, if @ is a variable that assumes several values in a certain context, 
he denoted the sum of all the values of w’s by omn. w. When summing terms 
that consist of more than one letter, he found it convenient to cover them with 
an overline to denote the extent of the terms to be added, as in omn. x@ to sum 
the products xw. Finally, he used the abbreviation ult., from the Latin ultima, 
to mean “last.” 

It must be said, before quoting from his work, that Leibniz managed to 
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construct a practical calculus that could solve real problems, but was never 
a model of clarity. Even his contemporaries, more used to the language of 
his time than we could ever be, found it heavy going to read and understand 
Leibniz. Nevertheless, at the risk of doing injustice to the author, we must 
plunge in. The reader has been forewarned. 

At a certain point in the manuscript of October 26 he changed the topic he 
was discussing and inserted the following [f. 1”; pp. 120-121; 150—151; 268; 
70-71]: 


Another [thing]: The moments of the differences about a [straight line] per- 
pendicular to the axis, are equal to the complement of the sum of the terms 


omn. @ w@ 


or: The Moments of the Terms are equal to the complement of the sum of 
the sums. 


Or 


omn xo M1 ult.x, Omn. &,, — omn. omn. © 


az, 
Let xw/N az. becomes: w MN —. it makes 
Xx 


_ az az 
omn. az 71 ult.x omn — — omn. omn — 
x x 


Therefore, 


And 


a a 
omn am ultx- omn — — omn. omn. — 
x x 


Which last theorem exhibits the sum of logarithms in terms of the given 
quadrature of the Hyperbola.°" 


°1T am trying to reproduce the manuscript formulas as closely as possible, including 
Leibniz’ punctuation (some of his commas look like periods) or lack thereof and his switch 
from omn. to omn without a period. However, the copies that I have used are not totally 
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To explain this result we can start by drawing a more detailed figure repre- 
senting the series of ordinates y (located horizontally) of a curve OC, and 


O D 


B Cc 


Leibniz assumed that the distance between any two consecutive abscissas is 
an infinitesimall small unit, which we shall denote by z. The reader who 
wonders how this distance can be a unit and at the same time infinitesimall 

small is not alone. Referring then to the figure in which the difference of 
any two successive ordinates has been labeled w, if x denotes the abscissa 
corresponding to an arbitrary ordinate y, then the area of the curvilinear figur 

OCD is the sum of all the products xw (which Leibniz called moments by 
analogy with physics, in which a moment is the product of a weight times 
the distance to the axis; the roles played here by w and x), that is, omn. x0, 
giving the left-hand side of his firs equation. But this area is that of the 
rectangle OBCD minus that of the curvilinear figur OCB. The area of the 


clear (see the sample reproduced on page 326) and I cannot make impossible claims about 
my readings. The reader should be aware of the fact that the previously printed versions of 
Leibniz’ formulas do not always coincide with the manuscript forms. For example, the last 
equation appears as 


a a 
omn. am ult.x omn. — — omn. omn. — 
x x 


in Die Geschichte, as 


a a 
omn. a 7 ult.x. omn. — — omn. omn. — 
x x 
in Der Briefwechsel, and as 
a a 
omn. aM ult. x omn. — — omn. omn. — 
x x 


in Child’s translation. The differences shown here may not be very significant but it gets 
worse, and there are instances in which Leibniz’ meaning has been completely altered. 
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rectangle is the product of the last x and the segment BC, that is, the product 
ult. x omn. @. The area of the curvilinear figur OBC is the sum of the areas 
of the rectangles formed by the unit and the ordinates, and each ordinate is 
the sum of all the w’s up to that particular spot, or omn. w, and then the area 
of OBC is omn. zomn. o. In conclusion, 


omn. X@ 7 ult. x Omm.@ — omn. Zomn. o, 

where we have eliminated the unnecessary commas of the original.9° Leibniz 
did not write the unnecessary unit z. This result will be interpreted below in 
today’s terminology. 

Since the curve in this discussion is completely general, we can consider 
particular cases. For instance, if xw = az, where a is a constant and z = 1, 
we obtain 

az az 
omn. az 7 ult.x omn. = omn. omn. = 

Of course, x should not come near zero in this equation, and we can assume 
that the abscissa of the ordinate OD is x = 1 in this case. Note then that 
omn. az/x is the sum of the areas of the rectangles formed by the unit z and 
the ordinates a/x, and therefore equals the area under the hyperbola y = a/x 
starting at x = 1, which is a logarithm. Then 

az 


omn. omn. — 
xX 


is a sum of logarithms. Thus, Leibniz has obtained “a sum of logarithms in 
terms of the known quadrature of the hyperbola.” 

On 29 October 1675, in Analyseos tetragonistice pars 2da. (Second part 
of analytical quadrature), Leibniz considered a problem in inverse tangents 
that shows the use of the so-called characteristic triangle, shown in the next 
figur as GWL (perhaps Leibniz was so proud of this feature that he labeled 


82 Leibniz has used omn. w with two meanings on this line. First it means the last y, to be 
multiplied by ult. x, and then it denotes an arbitrary y to be multiplied by z and added up. 

93 They had a meaning for Leibniz: ult. x was to be multiplied by everything between 
the next two commas, then everything after the third comma was to be subtracted from this 
product. 

94 For those who are impatient or prefer to figur it out by themselves, this is a formula 
for integration by parts. 

°5 Now Leibniz wrote the unnecessary unit z. Since the left-hand side of the equation is 
an area, the right-hand side must also be an area, and, to preserve dimensionality in those 
geometry-dominated times, Leibniz inserted a length z = 1. It is surprising that he did not 
do so in his firs result, where it would have served the same purpose. 
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P 


the triangle with his own initials). Here TL is the tangent to the curve AL 
at L, PL is the normal at L, / takes the place of w in the previous figur and 
represents the difference of consecutive ordinates, while a replaces z as the 
infinitesimall small unit that separates these ordinates, so we shall be able to 
take a = 1. Leibniz did not place any of these lowercase letters in his figur 
(while he drew some additional segments that are not shown here because they 
are not needed in our presentation), but established the notation in this manner 
[f. 2"; p. 123; 153; 290; 79]: 


BLoy. WLnI. BPNp. TBnt. ABNx. GWna. ynomn. I. 


Then he continued as follows (except that we display equations on their own 
lines as is the habit today) [ff. 2’—-2°; p. 124; 154; 291-292; 80]: 


Furthermore 
= P . 
a omn./ny 
Therefore 
omn / 
Pp Mn 
Therefore 
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but I have elsewhere proved that 


2 


th) 


96 


omn / 
> oF eS 
2 2 


Therefore we have a theorem that I regard as admirable, and [that will be] 
of great service to this new calculus in the future (Ergo habemus theorema 
quod mihi videtur admirabile, et novo huic calculo magni adjumenti loco 
futurum), namely 


omn. pn. 


omn / 


bh] 


Momn omn / —: 
a 


whatever / may be. That is if all the 7, are multiplied by the last one, and 
another [set of] all /, in turn by their last, and so on as often as it can be donef[, | 
the sum of all these [products] will be equal to half the sums of the squares, 
whose sides are the sums of the /, or all the /.9” this is a very beautiful and 
not at all obvious theorem. Likewise so is the Theorem: 


omn x/ Mx omn/ — omn omn/.%8 


putting ... 


at which point he explained the notation in the last equation. 
We shall interpret the meaning of these results shortly. But firs we shall 
establish the left-hand side in the admirable theorem by evaluating 


yl 1 _ 
omn. p = omn. — = —omn. yl. 
a a 
Leibniz would provide a geometric explanation later in his life, when he wrote 
a recollection of his discoveries on this subject as Historia et origo calculi 
differentialis in 1714 [pp. 9; 400; 40—41; 11]. Referring to a figur sharing 
many common elements with the previous one, he stated: 


°6 The boxed 2 in this equation is an exponent, and the infinitesima unit a need not be 
written explicitly. I do not know whether Leibniz actually proved this formula in a previous 
manuscript, although he made the statement several times, for instance, in the manuscript 
De triangulo curvarum characteristico of January 1675. On the basis of a figur like the 
one included here, but writing D instead of B and M instead of P, Leibniz obtained the 
firs equation in this quotation and then simply stated [f. 1"; p. 184] “Ergo summa omnium 
reductarum ipsius DL semiquadrato;” that is, that the sum of all the p’s is 5 y?. 

97 | eibniz’ use of plurals is confusing. This means “equal to half the square whose side 
is the sum of the /’s, or all the /’s.” 

98 This is his firs theorem in the previous quotation, but replacing w with / according to 
the new notation. Notice that Leibniz has deleted the abbreviation ult., thereby adding to 
the confusion of using omn / with two different meanings. 
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But straight Lines which increase from nothing [the y’s of an increasing 
function that passes through the origin] each multiplied by its corresponding 
Element [the / for each y] make a triangle. 


This triangle is shown in the following figure On the vertical axis we have 
placed the successive increments /, and on the horizontal each ordinate y is 


~ SS MS SN 


the sum of all the preceding /’s: y = omn./. The sum of the infinitesima 
rectangles y/ is approximately equal to that of the triangle, which, if we are 
given a “last” y, is y?/2. That is, 


y? _ (omn. 1)? 


omn. yl = = ; 


In Leibniz’ notation, and in view of the fact that pM yl/a, 


omn. / 2] 


omn. pl 5) 
a 


and we see that Leibniz had simply dropped the unit a to write the left-hand 
side of his admirable theorem. 

At this point in the Analysis tetragonistica Leibniz decided to improve his 
notation, stating: “It will be useful to write /. for omn. so that f/ [stands] for 
omn. /. that is the sum of all the /” [f. 2”; p. 125; 154; 292; 80]. He chose the 
symbol / as an elongated s that was used at that time, and it was meant as 
the initial of the Latin word “summa” for sum. With this new symbol /', the 
admirable theorem becomes 


326 THE CALCULUS CHAPTER 5 


as 


erie oa 4 eek hace 
he oath ri Cities 
. bbe cl? a 


The portion of folio 2” of the “Analysis Tetragonistica” manuscript containing Leibniz’ 
famous 4 and d notations (upper left on the second line from the top, 
and lower right on the third line from the bottom). 
Courtesy of the Gottfried Wilhelm Leibniz Bibliothek, Hanover. 


(note that Leibniz reinserted the unit a), and the equation 


omn x/ M x omn/ — omn omn / 
becomes 


fal xft — ffl. 


They appear in the manuscript in the following form, and these are not the 
2 _ 
Ay SIL + Fg xft sf. 


same as some of the formulas made available later in printed form. 
These equations are still foreign-looking today, but then Leibniz introduced 
the letter d to help rewrite each difference / of y’s in a more palatable way. 


99 


99 These equations have been electronically cleaned from a scan of the image at the top 
of this page in order to make them more readable, at the risk of throwing away the baby 
with the bath water. There is an old spill on this page which makes it difficul to read the 
equation on the right. The printed versions appear in Gerhardt’s Die Geschichte as [p. 125] 


Wo fir et fans fr- ffi 
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However, his firs attempt may be shocking. In his own words, he introduced 
this d operator as follows [f. 2”; pp. 126; 155; 293-294; 82] (once again, most 
of his commas look like periods): 


Given /. [its] relation to x. search for f 1. Which is done now from the 
contrary calculus namely if {7M ya, we put/n ~. Then as /'. will increase 
so d. will diminish the dimensions. But Ae . Means a sum, d. a difference: 


a 
From the given y we can fin a or /. or the difference of the y. 


Not very clear and not what we might have expected. Even accepting that a is 
the infinitesima unit, what is it doing here, what is this talk about dimensions, 
and why is d in the denominator? The equation {1 = ya and the statement 
““f will increase the dimensions” suggest that Leibniz was thinking of [J as 
an area, '°° and then it cannot equal the length y. It is necessary to multiply y 
by some other length a (which we must take to be unity) to make the equation 
homogeneous. The same kind of argument justifie that ya/d is a length. 
However, homogeneity cannot be a sufficien explanation for Leibniz’ de- 
cision to put d in the denominator, especially in conjunction with his statement 
that d means a difference. What difference? What does it mean to say that 
ya/d is l, the difference of the y’s? The differences between consecutive y’s 
could very well be all different, so which is the one that d produces from 
their sum f{/? A simpler explanation, which comes with a clarification can 
be sought in the figur on page 323. The similarity of the triangles TBL and 


in Der Briefwechsel as [p. 154] 


Ds offre d at sft f FF, 


and in Child’s translation in the form [p. 80] 


Pe = frit and fat=xft— f fi 


The firs left-hand side is now meaningless: a sum of squares has replaced what previously 
was the square of a sum. Whether the changes from manuscript to printed form to translation 
are due to transcriber and translator or to problems at the printer’s, this sample should give 
an idea of some of the difficultie faced by a modern reader trying to understand Leibniz 
from Child’s translation. However, Child pointed out on several occasions that he had been 
unable to see the original manuscripts, and could only translate from Die Geschichte. 

100 Recall that in the firs quotation in this section Leibniz had used the notation omn. w 
with two meanings, and one of them was as a sum of rectangular areas. 


328 THE CALCULUS CHAPTER 5 


GWL immediately gives 
ya 
tf 
This is what Leibniz might have had in mind, and this is a real quotient. And 
now we know which / is produced by this process: the one corresponding to 
the particular subtangent ¢ at a given y. But/ is a difference of y’s and Leibniz 
may have wanted to emphasize this point by the use of the letter d instead of t. 
However, this is reading Leibniz’ mind in retrospect. If we just read his 
words, he did not even make it easy for us to learn by example, for this is what 
he had to say in that respect [f. 2”; p. 126; 155; 294; 82]: 


l 


Hence one equation may be transformed into the other, just as from the 
equation: 


we can obtain: 


What we get from this, if we are not too inquisitive, is that / and d are inverse 
operators. What | does, d (in the denominator) undoes. If we also want to 
know what the equations mean, note that fl = y, and then, discarding the 
common constant c, recalling that a = 1, and assuming that fi : means (/1)?, 


the second equation reads 


2 y 


Thus, 1/d does here what the derivative does today. The firs equation is a 
little more problematic and needs interpretation. If Leibniz meant 


y 


= 
2 cfl 
fer ae 


then, discarding the c and recalling that a = 1, this is equivalent to 


101 This is my reading of the equations in the manuscript. Both Gerhardt and Child have 


/ /? on their left-hand sides. Child also has f B on both right-hand sides, as Gerhardt has 
in Die Geschichte. 
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which we regard as correct. Of course Leibniz’ notation at this point is hor- 
rendous and inconvenient, and we are all happy that it did not last long. On 
November 11, in a new manuscript entitled Methodi tangentium inverse ex- 
empla (Examples of the inverse method of tangents), he carried on with the d 
in the denominator for a while and then, without explanation or transition he 
wrote (this is out of context and its precise meaning is not in question now) 
[f. 1"; p. 34; 134; 162; 323-324; 95] 


N 


2 2 
Stte floor «eee 

y y 
From this moment on, after a little wavering, the d is always upstairs. Accept- 
ing this new position of d, as we must, we are now ready to rewrite some of 
his previous equations in this new notation. From {7 = y we obtain / = dy. 
Then Leibniz’ admirable result, which previously interpreted with the sym- 
bol { (page 325) and putting a = 1 was 


fe nffil, 


2 
y 
—_ = dy. 
2 fro 


Not much to write home about, since much more than this had already been 


now becomes 


obtained by Fermat and Gregory. Also, the equation f'x/n “it — f fl becomes 


xavany- fy ax, 


but where did the dx come from? It is just the infinitesima unit separating 
ordinates, so inserting it does no harm. 

Eventually, Leibniz would go on to develop the rules of the operator d, but 
on November 11, 1675, he just toyed with the idea that d(xy) and d(x/y) might 
be equal to dx dy and dx/dy [f. 2", p. 38; 137; 165; 328; 100], respectively, 
but discarded it. Keep in mind, before passing judgment, that Leibniz never 
meant any of these musings to be published. He was just making up his own 
mind about these matters. 


5.13, LEIBNIZ’ CALCULUS: 1676-1680 


Very little happened in Leibniz’ work that was of any consequence to the 
calculus for one year, except that he kept using his new notation and getting 
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enough consistency in writing the necessary differentials in all his “integrals.” 
But then, already on his way back to Germany, he stopped in London for about 
a week in October 1676. He probably wanted to check on what the English 
had been up to, either because he had heard something about it or out of simple 
curiosity. 

He did fin something and put it to good use. Collins was very open with 
him and showed him a copy of Newton’s De analysi, of which Leibniz made 
an ample extract.'°* But it is clear from what he copied and what he did not 
that this did not advance his own research in any manner. He also made an 
extract of a letter of 10 December 1672 from Newton to Collins. The extract 
is as follows: 1°3 


Letter of Newton, 1672: ABC is any angle, ABN x, BC Ty. Take for 


Cc 
D A B 
example, the equation 
0 l 0 0 2 3 
x3 — 2x?y + bx? — b’x — by? — y? n0. 
3 2 2 1 0 0 


Multiply the equation by an arithmetical progression, both for the second 
dimension y and for x; the firs product will be the numerator, and the other 
divided by x will be the denominator of a fraction which will express BD, 
thus, 

—2x?y + 2by? — 3y3 

3x2 — Axy + 2bx — Bb? 

Moreover, that this is only a corollary or a case of a general method for both 
mechanical and geometrical lines, whether the curve is referred to a single 
straight line, or to another curve, without the trouble of calculation, and other 
abstruse problems about curves, etc. This method differs from that of Hudde 
and also from that of Sluse, in that it is not necessary to eliminate irrationals. 


BDn 


102 Tt can be seen in its entirety, in Latin only, in Whiteside, The mathematical papers of 
Isaac Newton, I, pp. 248-259. 

103 Tf appears in Gerhardt, “Leibniz in London” (1891) 257-176. English translation in 
Child, The early mathematical manuscripts of Leibniz. This quotation is from page 194 of 
the translation. 
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Or, as Newton himself put it 


Suppose CB applyed to AB in any given angle be terminated at any Curve line 
AC, and calling AB x & BC y let the relation between x & y be expressed 
by any zequation as 


x3 —2x’y + bx? —b’x — by? — yy =0 


whereby the curve is determined. To draw the tangent CD the Rule is this. 


Here he repeated the procedure already outlined in Section 5.8 to solve Prob- 
lem 4 and obtained BD as copied by Leibniz. He also stated that his method 
is not “limited to equations w are free from surd [irrational] quantities.” 

In short, what Leibniz got out of this is that Newton had a method (the 
one shown in Example 4 on page 296) to fin tangents that was similar to 
that of Sluse but in which it was not necessary to eliminate irrationals. Leib- 
niz must have worked quickly to justify this rule using his new differentials 
because he included such a justificatio ina manuscript entitled Calculus tan- 
gentium differentialis, adjecta sub finem machina construendi equationes per 
logarithmicam of November 1676. Here we fin firs some rules for findin 
differences and sums [f. 3”; pp. 56; 140; 229—230; 614-615; 124]: 


dxnl dx2 n2x dx3 13x? &c 
to 2. 2 T_ 3 tee 
om e a" a & aa N-3 &C 


—_ 1 
dJ/xn ae &oi06 
From these the following general rules may be derived for the differences 
and sums of simple powers 
etl 


e+1 


dx®ne,x*! and conversely xen 


104 Turnbull, The correspondence of Isaac Newton, I, pp 247-248. 

105 Gerhardt corrected the last power of x to x* in Der Briefwechsel, as did the Séimtliche 
(except for full disclosure in a footnote), but that is not what Leibniz wrote, and both 
Gerhardt, in Die Entdeckung and in Die Geschichte, and Child wrote the right-hand side in 
the second equation on this line as —2/x*. The manuscript has -}+ in which the exponent 
of x looks like a broken 3. I have read it as a3, as did Gerhardt in Der Briefwechsel and the 
Sdmtliche. 

106 Gerhardt, in Der Briefwechsel, corrected the right-hand side of this equation to 1/2./x. 
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Needless to say, the e used here stands for an arbitrary exponent and not for 
Euler’s number e. This gives the derivative and integral of a power of x, but 
there is no explanation about how these rules were obtained beyond following 
the pattern shown by a few particular cases. As for these, Leibniz had already 
obtained dx? = 3x? as his firs example after the introduction of d, and 
dx? = 2x follows by the application of d to his admirable theorem. Then, 
from the firs of these general rules he obtained again some of the preceding 
particular cases: 


i _ 2 __ —_ 
Hence d— or d, x~? will be —2x~* or — and d,/x or d,x'/? will 
x? x 


1 /1 
be —5x!? or > 


Note that in all of Leibniz’ work to this point dx is just the infinitesima 
unit between successive values of x. But at this point he generalized this as 
follows [f. 3"; pp. 56-57; 140-141; 230; 615; 124-125]: 


107 


d 
Let yNx?. and it will be dyn 2x dx therefore oe m 2x. Such reasoning 


x 
is general, and it does not depend on what the progression for the x may be. 
In the same manner this general rule therefore holds: 
dx® 
dx 


xerl 


ne,x*! and in turn ie dxn 
e+1 
From now on the infinitesima distances between consecutive ordinates can 
be anything besides the unit. 
Next Leibniz showed us how to deal with arbitrary equations, even in 
implicit form [f. 3"; p. 57; 141; 230; 615-616; 125]: 


Let there be any equation whatever, for instance 


ay” + byx 4 cx? 4 fx gy thA>no 


and writing y + dy for y, and similarly x + dx for x, we have by omitting 
[terms] which should be omitted another equation: 


ay? + byx cx? fx ey +h | no 
a2dyy + bydx 2exdx + f?d% + g*dy 
no 
+ bxdy 
ady’ +bdxdy+cdx* | n|0 


107 Tn the previous quoted line the circled difference 5-1 is for Leibniz’ own reckoning 
and clearly to be discarded from the fina statement. This equation also contains several 
circled terms, and I have discarded them in compliance with Leibniz’ intention. 
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This is the origin of the rule published by Sluse. 108 


Before moving ahead we must interpret the last equation. All the terms to the 
left of the brace are those that result from replacing x and y by x + dx and 
y + dy and then expanding. The terms in a half box on the top line add up 
to zero because of the original equation, which is the equivalent of Fermat’s 
“remove the common terms,’ Barrow’s “reject all... terms which do not 
contain a or e,” or Newton’s “removing equal terms.” The terms in another 
half box on the bottom line add up to approximately zero because they are too 
small compared to the rest. From those that remain, the central group, dy/dx, 
can be found. 

Accompanying Leibniz on his discovery trip up to this point may lead some 
readers to think that his main contribution to the calculus was notation, nota- 
tion, notation.!°? However, in this paper of November 1676 we fin the fol- 
lowing important contribution to the calculus [f. 3”; p. 58; 142; 231; 616; 126]: 


Furthermore it will be worthwhile to adjust this kind of work to irrationals 
and compound fractions. 


dVa+bz+cz2, put atbz+crnx 


108 | eibniz used dx and dy (except that he forgot to overline the y in one instance) where 
Newton had used xo and yo and Barrow before him—in a similar Example | in Lec- 
ture X—had used a and e. Child thought [p. 125] that the similarity of Leibniz’ words 
omissis omittendis for “omitting which should be omitted” to Barrow’s rejectis rejiciendis 
is indicative of Leibniz’ debt to Barrow. In the letter to Collins of 10 December 1672, from 
which Leibniz made an extract while in London, Newton stated: “I am heartily glad at the 
acceptance w“ our Reverend friend Dr Barrow’s Lectures find with forreign Mathemati- 
cians, and it pleased me not a little to understand that they are falln into the same method of 
drawing Tangents with me.” Then, he proceeded to quote an example of what these foreign 
mathematicians had done. It is not inconceivable that, on reading Newton’s letter, Leibniz 
rushed to his copy of the Lectiones geometrice and used Barrow’s method after replacing e 
with dx and a with dy. But he had already considerable practice with his differentials, so 
he may have produced this proof on his own and then checked with Barrow and picked up 
some similar language in his explanation. Or this similarity may be coincidental. 

109 Such readers would not be alone, for Newton himself once said: “But certainly no man 
would call this Notation a new method of Analysis . .. ” Quoted from “Counter observations 
on Leibniz’ review [of W. Jones, Analysis per quantitatum series, fluxiones, ac differentias: 
cum enumeratione linearum tertii ordinis],” in Whiteside, The mathematical papers of Isaac 
Newton, Il, 1968, p. 273. Ironically, Leibniz himself seems to have contributed to this view 
when, referring to himself in the third person, he wrote: 


Furthermore it is now to be explained how Our [friend] arrived gradually to a new 
Notation, that he called the differential calculus. 


In Historia et origo calculi differentialis [pp. 13-14; 404; 49; 16]. 
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and 
d2/; 1 
sa _ > now —1b+2cz 
dx 24/. 
therefore 
b+2 
dVatbz+e2n ue 110 


2dz, Va+bz4+ c2z2 


What we have here is an example of the chain rule in its present form, although 
a similar device had already been used by Gregory and Newton. 

It was at this time, in an untitled letter of 21 June 1677, that Leibniz sent to 
Oldenburg, to be forwarded to Newton, that he firs made “public” his recent 
discoveries on the calculus. It was a reply to Newton’s of 24 October 1676, 
containing a lengthy explanation of his discovery of the binomial theorem but 
very little on his new calculus, that Leibniz had just received with a great delay 
due to his moving from Paris to Hanover. Leibniz’ reply was full of praise for 
Newton and included all of his knowledge about the calculus at that point. 

He started with a definitio of his differentials [pp. 154; 241; 168; 213; 
219-220], calling 


dy ... the difference of two close [values of] y ... anddx ... the differ- 
ence of two close [values of] x ... 


and continued with an example of “implicit differentiation,’ very similar to 
the one described above, stating that this method “is useful at a time, when 
irrational [quantities] intervene” [p. 155; 242; 170; 214; 220]. Next he in- 
cluded the evaluation of the differential of </a + by + cy? by the method of 
the chain rule [pp. 155-156; 243; 170-171; 214; 221], define a differential 
equation as one in which x is to be obtained from dx and provided an example 
[p. 156; 243; 172; 215; 221], and described the method of inverse tangents 
[pp. 158-159; 244—246; 175-178; 216—217; 223-224], claiming that it was 
generally more powerful than the method of infinit series. In one of the in- 
cluded figure he used subscripts in labeling all its points [Fig. 29 at the end 
of the volume; p. 244; 174; 216; 219]. 

But it was not until July 11, 1677, in a manuscript on a “General method 
to obtain the tangents to curves without calculation, and without reducing 


110 Gerhardt’s version in Die Entdeckung and in Die Geschichte, Child’s translation, and 
the Sdmtliche follow the original manuscript, which contains two wrong signs and the dz 
in the denominator. However, Gerhardt’s version in Der Briefwechsel has corrected these 
errors and introduced two new ones: dz/dx instead of dx /dz andno dz in the last quotient. 
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irrational or fractional quantities,” written in French instead of Latin,'"1 that 
he gave the complete set of rules of his differential calculus. By that time 
Leibniz could “differentiate” sums, differences, products, quotients, powers, 
and roots. After giving a long example on “implicit differentiation,’ much as 
he had given in the previous paper, he stated [f. 3”; p. 61; 144; 130]: 


Xr 
Now let the formula or equation, or magnitude w, be equal to —, I say that 
LL 


d@ will be equal to 
pdk—rAda 
a Ta: 
This will be sufficien to deal with fractions. 
Finally let w be equal to [replaced by] </@. I say that d@ will be equal to 


d@ 112 
2 VB 
which will suffic for the proper treatment of irrational magnitudes." 


It can be said, and it has been said, that Leibniz offered these and similar 
formulas without proof. This may be technically correct, but proofs are not 
necessary because they can be easily supplied by the reader using the same 
procedure that Leibniz had used to justify the tangent method. To wit, in the 
case of a quotient, if = A/, then ww = A and we would have (u+du)(o+ 
dw) = 4+ di. Now, omitting what can be omitted, as Leibniz would have 
put it, or removing common terms and rejecting products of differentials, this 
boils down to «dw + wd = di, which is the product rule,""4 and then 


dxX—a@du pwdrA—-Adu 
do = = 5) ; 
lL a 


111 Méthode générale pour mener les touchantes des Lignes Courbes sans calcul, et sans 
réduction des quantités irrationelles et rompues. z 

112 This differential is in error. The root in the denominator should be Ne. 

113 The overlines in Child’s translation are different throughout this quotation. In Die 
Entdeckung and in Die Geschichte the letter w in the square roots is not overlined, but it is 
in the manuscript. 

114 Newton had already given a form of the product rule in an Addendum to the fluxiona 
treatise, probably written in the winter of 1671-1672. After stating it in verbal form as 
Theorem | of the Addendum, he stated: “Let there be A: B = C: D [where A to D are 
fluents] and then A x f(D) + D x f(A) = B x f(C)+C x f(B) [where f denotes 
fluxion] This can be demonstrated in the same manner as the solution of Problem 1, or, 
alternatively, this way.” At which point he gave a geometric demonstration. See Whiteside, 
The mathematical papers of Isaac Newton, Il, pp. 330-333. 
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which is the quotient rule. The rule for the root can be easily obtained from 
his newly developed chain rule. 

In this manuscript Leibniz went on to give an “Algorithm of the new 
analysis for maxima and minima, and for tangents” in which, surprisingly, no 
mention is made of either maxima or minima. Then he revised the firs draft of 
this manuscript, possibly with a view to publication or to be included in some 
correspondence. In the revision, retitled Methode nouvelle des Tangentes, ou 
de Maximis et Minimis. ita ut non sit opus tollere irrationalitates, he gave 
credit to Fermat, Descartes, Hudde, and Sluse, but did not refer to Barrow at 
all [f. 1"; p. 62; 145; 131]. 

There is another manuscript written before Leibniz’ firs publication that 
never made it to print in his time. One remarkable thing about it is the manner in 
which he expressed himself on these matters, using subscripts for successive 
points on a curve or axis, providing proofs of some of the formulas of the 
differential calculus, and feeling sure of himself and of his new calculus at 
that time (the manuscript is undated but it may have been written about 1680). 
This shows in the grandiloquent title of the manuscript: “The elements of 
the new calculus for differences and sums, tangents and quadratures, maxima 
and minima, dimensions of lines [rectification] surfaces, solids, and for other 
things that transcend common means of calculation.”1'5 Some excerpts will 
show the polish of his ideas at this time. Leibniz was getting ready to publish. 

On tangents he had the following to say [f. 1"; pp. 149-150; 137]: 


Let CC be a [curved] line whose axis is AB, let BC be ordinates normal 
to this axis, to be called y, the abscissz on the axis AB to be called x. now 
the Differences CD of the abscissz will be called dx such are ;C, D, »C2D, 
3C3D etc.11® On the other hand the straight lines ; D2C, 2D3C, 3DaC (the 
differences of the ordinates BD) will be called dy. If now these dx and 
dy are taken to be infinitel small, ... then it is clear that the straight line 
joining these two points, such as 7C,C, ... when produced to meet the axis 
at ,T, will be tangent to the curve, and ;7;B ... will be to the ordinate 
1 B,C, as {C,D is to ;D2C or if ;7)B, or 27B, etc. in general called ¢ 


118 Flementa calculi novi pro differentiis et summis, tangentibus et quadraturis, maximis et 
minimis, dimensionibus linearum, superficierum, solidorum, aliisque communem calculum 
transcendentibus. 

116 | am using subscripts for convenience, but Leibniz just wrote (mostly) smaller numbers 
with the same baseline as the letters, as in 1C 1d, 2C 2d, 3C3d, actually using here a lower- 
case d. The manuscript figur also shows an additional curve from A to the line 4B4C, 
which serves no purpose. It was not shown in Die Geschichte and it is not shown here. 
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then t: y::[=]dx: dy. And thus to fin the differences of series is to fin 
tangents. 


In other words, to fin the slope of the tangent at a point, y/t, one must fin 
the quotient dy/dx, so that it is imperative to learn how to fin dy at a given 
point of what we now call a function of x. 

Turning to quadratures, he wrote (if I interpret Leibniz’ punctuation cor- 
rectly) [f. 1"; p. 150; 138]: 


Furthermore differences are the opposites of sums. therefore the ordinates are 
the sums of their differences, so 4B4C 117 is the sum of all the differences, 
such as 3D4C, 2D3C, etc. up to A. even if they are infinit in number, which 
I describe as f dy equ. to y. Also the [area] of a figure I describe by the sum 
of all the rectangles [obtained] from the ordinates times the differences of the 
abscissz, such as |B; D+ .B>D + 3B3D. etc. On the other hand the small 
triangles ;C; D.C, 2C2D3C, etc. since they are infinitel small compared 
with the said rectangles, may be omitted with impunity, thus I denote in my 
calculus the area of the figur by { y dx. or the sum of the rectangles, formed 
by each y, times the corresponding dx. 


Then, “soaring high” (altius assurgentes), he obtained the area between a 
positive curve and the x-axis as follows [f. 1"; pp. 150-151; 138]: 


. let the given curve of the figur_to be squared be EE,''® of which the 
ordinates are E B ''9 which we call e. They are proportional to the differences 


117 The manuscript has 4B4B. 

118 No figur was provided by Leibniz, but the one included here is drawn in his own style, 
with the origin at A and the direction of the positive x-axis down. For clarity, there are two 
implied positive y-axes, one directed to the left for the curve EE and one to the right for 
the curve CC. 

119 The manuscript has ED. 
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of the ordinates BC, or to dy, or let .; B) E: 2B) E :: [=]; D2C :2D3C, and 
so on, or let as A; B is to ; B,C, or, as ;C; D is to | D2C, or as dx is to dy be 
as a constant or always permanent straight line [segment] a is to | B) E ore. 
then we have dx : dy::a:e or edx equ.ady Therefore { edx equ f ady 


To interpret this result, we split the figur in two and rearrange the parts 


as shown next, eliminating some of the clutter. We have also taken the liberty 
to replace Leibniz’ B with x. Then we can restate the result in present-day 


y=F (x) 


e=f(x) 


A A 
x T x 


terms as follows. Let the curve EE represent a positive function, which we 
shall denote by e = f(x), and let y = F(x) bea function with graph AC such 
that for some constant a and any x, 
dy e 
dx a 


Then the area under the graph of e from A to x is 


/ edx =ay =aF (x). 


A 
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Ifwetake a = 1 andnote that F(A) = 0, which Leibniz did not state explicitly, 
the result is that if F’(x) = f(x) then 


[ feoar= F(x) — F(A), 
A 


the fundamental theorem of calculus. Of course, this is nothing new at this 
time. This result had been frequently used by Newton, who had not published 
anything on the calculus yet, but it had been proved in print by both Gregory 
and Barrow. In fact, Leibniz’ presentation is quite similar to Barrow’s except 
for the differential and integral notation. Perhaps this is precisely what Leibniz 
intended: to show how a known geometric result can be expressed in his new 
calculus. 

Further ahead in the same manuscript, he discussed some of the other 
topics promised in the title [f. 1”; pp. 151; 139-140]: 


... for ;C2C is equ. to /dx -dx + dy-dy. From this we have at once 
a method for findin the length of the curve by means of some quadrature, 
and then [ff. 1’—2"; p. 152; 141]: 


On the other hand if it is required to fin the centers of lines, and the surfaces 
generated by their rotation, ex. gr. the surface [generated] by the rotation of 
the line AC. about AB., we must only fin 


[> dxdx+dydy ... 


Therefore the [whole] thing is at once reduced to the quadrature of some 
plane figure if we substitute for y and dy the values [obtained] from the 
nature of the ordinates and the tangents to the curve. 


As for the differential calculus, he mostly repeated what he had already 
written in the July 1677 manuscript, but this time he included proofs. For 
instance, he found that [f. 2”; p. 154; 143] 


dxy equ x+ dx times y+ dy — xy or xdy + ydx + dx dy and the 
omission of the quantity dx dy, which is infinitel small with respect to the 
rest, given that dx and dy are infinitel small, will produce xdy+ydx ... 


For the quotient rule he computed 
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y ytdy y x dy — ydx 499 
d-—e or 
x qn xtdx x xx +xdx 


whereby writing «x for xx+x dx since x dx can be omitted as being infinitel 
small with respect to xx makes: 
xdy — ydx 


Then he found the differentials of xyv (what is known today as Leibniz’ rule) 
and y/(vz), and concluded with the differentials of powers and roots. 


5.14 THE ARITHMETICAL QUADRATURE 


The longest mathematical manuscript of Leibniz, containing 51 propositions, 
did not even mention the calculus or differentials. But at the core of the 
manuscript, he proved rigorously that the area under a curve can be approx- 
imated as closely as desired by what we now call “Riemann sums.” It was 
written in 1675-1676, shortly after a time in which he thought of dx as an 
infinitesima unit. However, in De quadratura arithmetica circuli ellipseos 
et hyperbole cujus corollarium est trigonometria sine tabulis (The arithmeti- 
cal quadrature of the circle, the ellipse, and the hyperbola whose corollary is 
trigonometry without tables) he showed tremendous maturity and progress, 
and for this reason it seems that its placement in this overview of his work 
should follow that of the 1680 work discussed in the previous section. Leibniz 
left the manuscript in Paris before his departure for Germany in October, hop- 
ing that its publication could be used to secure him a position at the Academy 
of Sciences. He left the manuscript, together with some drafts for corrections 
and extensions, in the hands of his friend Soudry, who would be in charge of 
preparing the fina copy. 

Soudry, unfortunately, died suddenly in 1678, and the manuscript remained 
in possession of Friedrich Adolf Hansen, who had discussed its publication 
with Soudry. He then put it in the hands of Christoph Brousseau, from Hanover 
but a resident of Paris. Brousseau, in turn, passed it to Isaac Arontz, who was 
planning to travel to Hanover, for delivery to Leibniz. This happened in 1680, 
while Soudry’s copy became and remains lost.‘ 


120 The word “or” and the numerator of the last fraction are crossed out in the manuscript. 
121 These facts about the whereabouts of the De quadratura arithmetica manuscript are 
from Knobloch, “Leibniz et son manuscrit inédite sur la quadrature des sections coniques,” 
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After unsuccessfully attempting to publish the original manuscript, Leib- 
niz gave up, and later he refused to publish it because it had become anti- 
quated. Instead, he wrote an abbreviated version, Compendium quadrature 
arithmetice, using differential and integral notation, but it contains only a few 
sketches of proofs. 

Before stating Proposition VI, which is at the heart of the manuscript, 
Leibniz set the scene with some necessary preliminaries based on a very de- 
tailed figure The most interesting part of this figur for us includes the corner 
near the origin at A, which is too small a portion of Leibniz’ larger figur to 
comfortably distinguish points, letters, and segments. For this reason, we have 
taken the liberty to greatly enlarge and somewhat distort a portion of the figur 
containing this corner, as shown on the next page, and we have kept only those 
elements that are relevant in the following discussion. The positive x-axis is 
located vertically down from A and the positive y-axis horizontally to the right 
of A. As we well know by now, Leibniz placed his subscripts on the left of their 
letters. However, the propositions and proofs that we are about to present are 
somewhat lengthy, and such an arrangement may be tiring for the present-day 
reader. Thus, in the enclosed figur and in the quotations and explanations 
that follow, we write the subscripts on the right, as is today’s fashion. Then, 
if “CC etc. Cy” is a given curve (we would talk about a continuous function 
that has a tangent at each point), he constructed the tangents CT to the curve, 
and, from each T, the perpendicular to the corresponding ordinate BC gen- 
erates the point D. These points form a new curve “Dj, D> etc. Dy.'22 Next, 
the chords “C;C» etc. up to C3C,4” (here and in what follows, “up to” means 
to the end of the curve, however many C’s are placed on it, and it was just 
convenience for Leibniz to end at C3C4) cut the right angle TAB at [f. 8”; 
p. 28; 49; 8]'28 


points M. that fall between the points T, as M, is between 7; and 7), and 
Ms; is between 73 and 7. and similarly from these points of intersection M 
drop other perpendiculars M,N, P;, up to M3N3P3... 


1986. This paper also contains proofs of all these assertions. Knobloch’s analysis can also 
be seen in the introduction to his 1993 edition of De quadratura arithmetica. 

1221 eibniz used a great deal of underlining in the De quadratura arithmetica and, in 
particular, in his presentation of Propositions VI and VII, of which this is an example. | 
have decided to omit it from this point on, but the interested reader can see it in Knobloch’s 
edition. 

123 Page references are to manuscript folio, to Knobloch’s edition of the De quadratura 
arithmetica, 1993; to Parmentier’s French translation Quadrature arithmétique, 2004; and 
to Hamborg’s German translation Uber die arithmetische Quadratur, 2007. 
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to the ordinates B, Ni, up to BaN4. 
Then the text of Leibniz’ Proposition VI is as follows [f. 8"; p. 29; 51; 
8-9]: 


Thus these rules and preparations [concluded], J say that we can consider in 
the [two] curves points C between C, and C4 and points D between D, and D4 
in such number and so close to each other, that the rectilinear gradiform space 
NB, Bg P3N3 P2N2 P, N, composed of the rectangles N, B, Bz P,, and others up 
to N3B3B4P3 which are comprised between the ordinates N, B, or if needed 
their prolongations, and others up to N3B3, and by the intervals B, Bz, and 
others, up to B3B4.; differs by a quantity smaller than any given [quantity] 
whatever from the quadrilinear space D, B, B4D4D3 etc. D, (delimited by the 
axis B, Ba, the outside ordinates B,D, B4 D4 and the new curve D, D2 etc. D4). 
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And the same demonstration holds for any other mixtilinear and gradiform 
space formed by the continuous application of straight lines on a certain axis. 
One can take it as proved that the areas of figures can be obtained from the 
sum of straight lines, as addressed by the method of indivisibles. It is required 
that the curves or at least the parts in which they are split, have their concavity 
on the same side, and lack reversion points. 


Reversion points (Puncta reversionum) were immediately define by Leib- 
niz as those at which the ordinate is tangent to the curve. After this, he provided 
a proof in eight stages, which he numbered in parentheses and later referred 
to as articles. 

In (1), the fact that the arc D; D2 has no reversion points allowed Leibniz 
to construct the point F| as the intersection of the segment Nj) P;, parallel to 
T, D,, with the arc D; D>. Then he added [f. 9”; p. 30; 53; 10]: “In the same 
manner the other portion Dz D3 will cut the straight line NP, at Fo. etc.” 

In (2) he extended each segment T D until it intersects the next ordinate at E. 
He called each rectangle N; B; Bz P}, Nz B2B3 P2, ... an Elementary Rectangle, 
each rectangle D| E; Dz, D2 E2D3, ... the complementary rectangle (he left 
it to the reader to imagine the remaining vertex of each of these rectangles). 
Leibniz asserted that the [absolute value of the] difference between the partial 
Quadrilinear D; B; ByD2 Dj, and its elementary rectangle is smaller than the 
corresponding complementary rectangle, which in current notation we write 
as 

|D, B, Bo D2D, _ NB, BoP | < D,E, Dp. 


He proved this by noting that the partial quadrilinear and the elementary rect- 
angle have in common the Quintilinear space D; By By P; F; D;. Subtracting it 
from each leaves the trilinear spaces D> P; F; Dz and D; Nj F\ D,, respectively. 
Thus, in today’s notation, 


|D) B, By D2 D, — NB, B2P\| < |D2P) Fi) D2 — Di N\ F\ DI. 
In (3) [f. 9”; p. 31; 55; 11] Leibniz stated: 


Then it will suffic to show that the difference between these two trilinears 
is smaller than the complementary rectangle D; E; D2, which is clear... 


and then explained how this is clear. 

In (4) [f. 9; p.31; 57; 11-12] he stated that the same is true for the difference 
between the next partial Quadrilinear DB, B3D3D> and the corresponding 
elementary rectangle Nz B2B3[P2], and for the others. Then the difference 


344 THE CALCULUS CHAPTER 5 


between the total Quadrilinear D; B, By D4 D3 etc. and the rectilinear gradiform 
space (spatio rectilineo gradiformi) N, B,B4P3N3P2N>P,N, is smaller than 
the sum of all the complementary rectangles. In current notation, and if there 
are n quadrilinears, this means that '24 


n 
S* [D: Bi Bi+1Di+1 Di — Ni Bi Bi+1 Pi] 


i=1 


n 
< Y > Dj Ej Dis. 
i=1 


In (5) Leibniz observed that the rectangles in the last sum have bases 
D4sE3 = D4L3, D3E. = L3L2, up to DoE; = L211, and heights Dj EF; = 
B, Bz, DoE. = B2B3, ... If the heights B; Bz, B2B3, ... are all equal, then 
the sum of the areas of these rectangles equals the product of the sum of the 
bases, D4L,, and B3B4. Otherwise, the sum of these areas is smaller than the 
product of the sum of the bases and the largest of the heights. If we assume 
that the largest height is the last, B3B, = Ww D4, then the sum of the areas of 
all the complementary rectangles is less than or equal to that of the rectangle 
WD4Ly. 

In (6) Leibniz simply stated that it follows from articles 4 and 5 that the 
difference between the total Quadrilinear [D)B, B4D4D3D Dy, | and the grad- 
iform space [N; B; B4 P3N3P2N2P,N,| is smaller than the sum of the comple- 
mentary rectangles, and therefore smaller than the rectangle w D4L}. 

In (7) [f. 9°; pp. 31-32; 59; 12—13] he noted that the height of the rectangle 
w D4L, can be assumed to be smaller than any given quantity, since the D 
points can be taken arbitrarily close to each other, and in any number whatever. 
Thus, the area of the rectangle yD4L, can be made arbitrarily small. 

Finally, in (8), Leibniz concluded that it follows from articles 6 and 7 
that the difference between the total Quadrilinear [D)B, B4D4D3 DD | and 
the gradiform space [N B; B4 P3N3 P2N> P;N,| can be made smaller than any 
given quantity. 

This is the modernity of Leibniz’ thought and methods. The “infinitesima 
unit” of his earlier tentative manuscripts is nowhere present in this treatment. 
Instead, this is a modern proof by reductio ad absurdum of the integrability 
of a large class of functions, and Leibniz was justifiabl proud when he stated 
that [f. 10”; p. 35; 71; 18] 


I believe none is simpler and more natural, and closer to the direct demon- 
stration, than that which not only simply shows, that there is no difference 


124 This makes implicit use of the triangle inequality. Leibniz had already established a 
result that amounts to the triangle inequality as Proposition V of this tract. 
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between two quantities, so that they are equal, (which elsewhere is usually 
proved [by showing] that neither is greater nor smaller than the other by a 
double reasoning) but one that [uses] only one middle term, namely either 
inscribed or circumscribed, not both simultaneously[.] 


We could not have deflate Leibniz’ joy by pointing out that there is some 
unfinishe business in this work, because he had already anticipated us and 
finishe that business when he wrote this assessment. The apparently unfin 
ished business is this: we wanted to approximate the area “under” CC) etc. 
C4, but we have only approximated the area “under” D; D2 etc. D4. Leibniz 
took care of this in Proposition VII.'% If we agree to call C; C2 etc. C4 the firs 
curve and D, D> etc. D4 the second curve, the conclusion of this proposition 
reads [f. 10"; p. 33; 65; 15]: 


. the space comprised between the axis (on which the ordinates are 
drawn,) the two outside ordinates, and the second comprising curve, is double 
the space comprised between the first curve and the two straight lines whose 
ends are joined to the given right angle. 


Leibniz explained that these two right lines are the segments AC; and 
AC; [f. 10"; p. 34; 67; 15]. Thus, the proposition states that the area of 
the quadrilinear figur Dj B,B3D3D 2D, is twice that of the trilinear figur 
C,AC3C2C, (in either case, D3D2D, and C3C2C, are arcs of curve). The 
proof, which could have used any number of ordinates rather than just three, 
was done in fi e stages. 

In (1) Leibniz assumed that the conclusion does not hold, and denoted by Z 
the difference between twice the Trilinear area and the Quadrilinear one. Then 
constructed the polygons AC;C2A, AC; C2C3A, and so on, reminded us of the 
construction of the points M on the Y-axis and of the perpendiculars MN, and 
then denoted their intersections with the ordinates B2C2, B3C3, etc. by S$, So, 
etc. These were previously called P, so a new figur is not necessary. 

In(2) [f. 10"; p. 34; 67; 15] he assumed that the construction of the inscribed 
polygons is such that 


the difference between the polygon AC, C,C3A and the Trilinear C; AC3C2C 
[recall that C3C2C, is an arc of curve here], as well as the difference be- 
tween the rectilinear Gradiform space B; N; S; Nz Sz B3 B,, and the Quadrilin- 
ear D, B, B3D3D 7D, [same for D3D2D,], each one separately, is smaller, 


125 | eibniz had already found this proposition in May 1673. See Scimtliche Schriften und 
Briefe, Ser. Il, 1, p. 115. 
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than the fourth part of Z. We can in fact continue the very same [process] 
until [each] is smaller than any given quantity. 


In (3) Leibniz used Proposition 1 of this tract to show that [f. 10”; p. 35; 
69; 17] 


twice the triangle AC; C2 is the rectangle [N, B, Bz] and twice the triangle 
AC2C; is the rectangle [NB B3], and so of the others ... 126 


Therefore, the sum of all the rectangles of this type, which is the Gradiform 
space, will be twice the sum of all the corresponding triangles, or the inscribed 
polygon. 

In (4) he denoted the Quadrilinear [D, B, B3D3D Dj] by Q, the inscribed 
polygon [AC,C2C3A] by P, and the Trilinear [Cj AC3C2C , where C3C2C| is 
an arc of curve] by T. The difference between Q and the gradiform space, 
whose area equals 2P by Article 3, is smaller than Z/4 by article 2. Also by 
article 2, the difference between P and T is smaller than Z/4. Leibniz did not 
use absolute values, but if we do, all this means that 


|Q —2T| <|Q—2P|+|2P —2T| < 4274+ 4Z = 3Z. 


Finally, Leibniz remarked in (5) that comparing this inequality with the 
definitio of Z in article 1 shows that |Q — 27| is smaller than itself, which is 
absurd. Thus, the original assumption cannot hold, and Q = 27. 

What this result does is allow us to compute the area AB3C3C2C| A, be- 
tween the C curve and the x-axis, as the sum of T = 5 Q and that of the triangle 
AB3C3A, and the method is applicable if Q is easier to compute than the orig- 
inal area. Appropriately, in his Historia et origo calculi differentialis, Leibniz 
referred to this procedure as the transmutation method (Methodo transmuta- 
tionum) [p. 11; 402; 44; 13]. 

This method and much additional work allowed Leibniz to perform the 
general quadrature of ellipses and hyperbolas in Proposition XLHI of De 


126 We can use his argument while keeping the notation now in use as follows. Prolong 
CC, M, in this direction and drop a perpendicular AH to this prolongation (the reader is in 
charge of drawing the figure) Now prolong B, N;C; to the right and drop a perpendicular 
C2G to this prolongation. The triangles AM, H and C2C,G are similar because /AM, H = 
ZC2C 1G. Then, 

AH C2G AH B, By 

AM CiC) ” BN CiC 
so that B} N; x By, By = AH x C,C2. The right-hand side 1s twice the area of the triangle 
AC,C? that has base C; C2 and height AH. 
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quadratura. We have neither the space nor the desire to embark on such a 
long trek. Instead, to give an example of the method of transmutation in the 
particular case of the circle, we follow a shorter route, as Leibniz himself did 
at the end of 1675, in an untitled and unsent letter in French to Jean-Paul de 
La Roque, editor of the Journal des scavans.'*’ Leibniz provided essentially 
the same figur as in De quadratura but different notation, but it is simpler if 
we use the same notation as above. He assumed that the curve AC) C2C;3 is 
an arc of the circle of radius a centered at x = a whose length is no larger 
than that of a quadrant, and stated that (recall that we are replacing his new 
notation with the present one) [p. 91; 349] 


Since the curve AC; CC; is an arc of circle, the curve of intercepts, namely 
AD, D2 D3, can be related to the right angle BAT, by this equation 


2az* 


= x, 

a +e 

where z represents the ordinate BD. Leibniz did not explain how he obtained 
this equation, but some explanation is preferable. Note that we would write 
the equation of the circle as (x — a)? + y* = a’, which gives y? = 2ax — x?. 
Then, according to Leibniz’ own rules of the calculus, 2y dy = 2a dx —2x dx, 


or 
dy a-x 


dx iy 
If we now imagine a small characteristic triangle at C3, with legs dy and dx, 
it is clearly similar to the triangle C3 D373, and then 


ay yrs 
dx x 


Therefore, solving for z, using the quotient previously obtained for dy/dx, 
and the expression given above for y7 yields 


dy =e. yee ta ax a’x 


L= X =y Xx = = — S 
of dx ~ y y y 2a—x 


Squaring and solving for x provides the equation given by Leibniz. 


1271 eibnizhad firs communicated the quadrature of the circle, but without proof, to Henry 
Oldenburg in the summer of 1674. It also appears as Proposition X in a communication to 
Huygens of October 1674. See Sdimtliche Schriften und Briefe, Ser. Il, 1, pp. 117 and 165. 
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Direct use of the transmutation theorem calls for the evaluation of the area 
Q = AB3;D3D)D,A, but Leibniz did this indirectly, computing firs the area 


2az? 
AT;D3D2D\A = | xdz = fog dz 


(these equations are for our convenience, because Leibniz did not use the 
integral sign in this letter or in De quadratura). To do this he employed 
[pp. 91-92; 349-350] 


the beautiful method of Nicolaus Mercator [long division] according to which 


since a is unity, and 5 equal to the same x[/2] will be equal, to 


Zz 
142? 
2? — zt +76 — 28 ete to infinit . And the sum of all the x[/2], equal to the 
sum of all the z* — z+, etc. now... the sum of all the z? [from z = 0 toz = D] 

2 5 


b 
will be — and the sum of all the z* will be — etc. (by the quadrature of 


parabolas) hence the sum of all the x, or the space AT; D3[D2Dj,]A, or the 
difference between the rectangle T;A B3[ D3], and twice the segment of the 
3 


ircle, A[C,C>]C3A, will b ie A a 
circle, , will be | etc. 
ae aie MA i 
Accepting that the radius of the circle is a = 1 and writing z instead of 


Leibniz’ b for an arbitrary ordinate BD, these statements can be rendered in 
current notation as follows. First, the “sum” of all the quotients x /2 is 


[ye -| z ee oe aan 
glia a Say 


and then, noting that the area of the rectangle T;AB3D3 is xz and that the 
circular segment AC;C2C3A was denoted by T, Leibniz observed that 


fxae=az-21. 


It follows from Lemma VII of De quadratura, a result included as the main 
theorem in Leibniz’ letter, that 


XZ - 2 z! 


pO =T=— [pe= oe 
LG Zo fs 5 oD 


and adding to this the area of the triangle AB3C3A = sxy gives the desired 
quadrature 


/ d XZ + xy 3 a z! 
‘o— 
° 2 ao 3. “9 
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Although it is possible to rewrite the firs fraction in terms of z only, it would 
not add to the meaning of the result. Just observe that at the point (1, 1) of this 
circle its tangent is vertical, so that the corresponding value of z is 1. Putting 
x = y =z = | in the preceding equation gives the area of a quadrant of the 
circle, which is 7/4; that is, 


=1 
4 a” 5 77 


Leibniz obtained this identity in De quadratura in the following form [f. 24’; 
p. 79; 219; 82]: 


PROPOSITION XXXII 


The Circle is to the circumscribed Square, or the arc of the Quadrant is to the 
Diameter, as 


is to unity. 


Clearly, Leibniz meant the areas of the circle and the square and the lengths 
of the quadrant and the diameter. This expansion is known as the Leibniz 
formula, and also sometimes as the Gregory—Leibniz formula. '7® 


5.15 LEIBNIZ’ PUBLICATIONS 


Leibniz may have been ready to publish by 1680, but he also seemed somewhat 
reluctant, at least judging by the amount of time he let pass before he finall did 
so, perhaps fearing that his infinitesima quantities may not be well received. 


128 In a letter of 15 February 1671 to John Collins, Gregory wrote: “Sit radius = r, arcus 
= a, tangens = f, secans = 5, erit 
e “ pf? ‘ ad 
=t ” 

“ 3r2 0 Sr4Tr6 98 
(quoted from Turnbull, James Gregory tercentenary memorial volume, p. 170 and The 
Correspondence of Isaac Newton, 1, 1959, p. 62). Putting r = 1 andt = 1 fora = 2/4 
gives the Leibniz formula, but Gregory never wrote it. 

However, the Leibniz and Gregory series had already appeared in Jyesthadeva’s Yuk- 
tibhasa and in the anonymous Tantrasangrahavyakhya, and are very likely the work of 
Madhava. For details see Rajagopal and Rangachari, “On an untapped source of medieval 
Keralese mathematics,” 1978, pp. 91-96. 
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But he had essentially completed his investigations and he may have been 
concemed that his friend and colleague Ehrenfried Walter von Tschirnhaus 
(1651-1708) might get ahead of him and publish his own thoughts on the 
subject. When Leibniz helped some friends found the journal Acta Eruditorum 


MENSISOCTOBRISA.MDCLXXXIV, 467 
NOVA METHODFS PRO MAXIMIS ET ML 
nimis , itemque tangentibus, que nec fratizs, nec irrati- 
onalesquantitates moratur, 6 fingulare pro 
ils calculi genus, per G.G.L. 
Gleaxis AX, dc curve plores, ut VV, WW, YY, ZZ, quarum ordi- TAB, XU. 
‘nate, ad axom normales, VX, WX, YX, ZX, qu vocentur refje- 
clive, v, vv, ¥, 2; & ipfa AX ableiflaab axe, vocetur x, Tangentes fint 
VB, WC, YD, ZE axi occurrentes refpective in pundtis B, C, D, Es 
Jamrecta aliqua pro arbitrio allumta vocerur dx, & reéta que fic ad 
dx, ita (vel vy, vely, velz) etad VB (vel WC, vel YD, vel ZE) vo- 
cetur de (veld vv, veldy vel dz) five differentia :plarum w (vel ipfa- 
tum vv, aut y, autz) Hispofitis calculi regylz crunt tales: 
Sica quantitas data conftans, erit da xqualiso, & d ax crit eque 
adx:fifityxqu. 2 (feu ordinataquayis curve YY , equalis cuivis or- 
dinate refpondenti curve VV )eritdy equ, dy. Jam Additio & Sub- 
trade: fi fitz-y{ ywtx xau.v, crit da-ytwixftudr, qu. 
dz -dytdvvt dx, Afuhiplicario, dxvxqu.xdytodx, feu pofire 
Yaqu. xv, fierdy equ. xdy ft edx Inarbitrio enim eft vel formulam, 
ut xp, vel compendio pro calitcram,uty,adhibere, Notandum &x 
& dxeodem modo in hac calculo traétari,ut y & dy,vel aliam hiteram 
indeterminatam cum fua differentiali, Notandum etiam non dari 
femperregrefluma differentiali Aquatione, nifi cum quadam cautio- 


P ” 
ne,dequoalibi. Porro Divifio, d—vel (pofitoz equ, )dzzqu 
tedytydy + 


Y x 


"Quosd Sigrahoc probe notandum , cum in calcula pro litera 
fabftituitur fimpliciter ejus differentialis, (ervari quidem eadem figna, 
& pro +2 feribitdz, pro--z feribi--dz, ut ex additione & fubtra~ 
Gione paulo ante pofita apparet; fed quando ad exegefin valorum 
venitur, feu cum confiderarur ipfus zrelatio ad x , tunc apparere, an 
valor ipfius dz fit quantitas afirmativa, an nihilo minor feunegativa: 

od polterias cum fit, cunc tangens ZE ducitura punéto Z non yer- 
lus A, fed.in partes contrarias {cu infra X,id eft tunccum ipfa ordinatz 
Nnoj z decre- 


The firs page of Leibniz’ publication on the differential calculus 
From Struik, A source book in mathematics, 1200-1800. 


in 1682, he was invited to submit a paper, and Leibniz chose to write on the 
quadrature of the circle, giving the formula stated above but without a proof. '2° 
Finally, two years later he unveiled his differential calculus, in the October 


129 “De vera proportione circuli ad quadratum circumscriptum in numeris rationalibus, a 
G. G. Leibnitio expressa,” 1682. 
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issue of the same journal, with a paper entitled “A new method for maxima 
and minima, as well as tangents, not hampered by fractional nor irrational 
quantities, and a singular calculus created for them, by G. W. L.”'°° It begins 
as follows [p. 467; 168: 220]: 13" 


Let an axis AX, and several curves, such as VV, WW, YY, ZZ, whose 


la dx 


D 
x 
C 
L 


x 


oV 


2B 


ordinates, normal to the axis, VX, WX, YX, ZX, are called respectively, 
v, w, y, z; & this [portion] AX cut off from the axis, is called x. Let the 
tangents VB,'°? WC, YD, ZE intersect the axis respectively at points B, 
C, D, E. Now some arbitrarily selected straight line is called dx, & the line 
that is to dx, as v (or w, or y, or z) is to XB (or XC, or XD, or XE) '33 is 


130 “Nova methodvs pro maximis et minimis, itemque tengentibus, que nec fractas, nec 
irrationales quantitates moratur, & singulare pro illis calculi genus, per G. G. L.” 

131 Page references are to the Acta, to Opera mathematica and to Leibnizens Mathematische 
Schriften in this order. I follow the Acta’s text except for obvious errors. 

132 Two possible arrangements are shown in the figure labeled ; V; B and V2 B. 

133 The last four segments are incorrectly given in the Acta and in the Opera as VB, WC, 
Y D, and Z E, but the mistake was corrected in the Schriften. The mistake can be attributed to 
Leibniz himself, since the printed version is, on this point, identical to a manuscript draft that 
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called dv (or dw, or dy or dz) or the difference of these v (or these w, or y, 
or z) This posed the rules of calculus are such: 


If a constant quantity a is given, then da equals 0, & d ax will be equ. adx: 
if y is equ. v (or if any ordinate whatever of a curve YY, is equal to the 
any ordinate whatever of a corresponding curve VV) then dy will be equ. 
dv. Now Addition & Subtraction: if z — y + w+ x, is equ. v, it shall be 
dz—y+w +x '™ or dv, equ. dz — dy + dw +dx. Multiplication, dX 
equ. x dv+v dx, or given that y equ. xv, dy becomes equ. x dv-+v dx. In fact 
it is a matter of choice whether we employ a formula, such as xv, or shorten 
it to a letter, such as y. Note that & x & dx are treated in this calculus, in the 
same manner, as y & dy, or any other indeterminate letter with its difference. 
Note also that it is not always possible to go back from a differential Equation, 
except with some caution, of which [we will talk] elsewhere. On to Division, 
tudy + ydv 


d~ or (siven that z equ. to ~) dz equ. 
y y yy 
It is not necessary to continue in detail. In the next paragraph he remarked 
that [p. 468; 168; 221] 


at the very moment when v neither increases nor decreases, but is flat dv is 
exactly equ. 0, ... at this very same place v, namely the ordinate LM, is 
maximum (or ... Minimum), ... 


A few lines below he define point of inflectio (punctum flexus contrarii), 
characterized it in terms of changing concavity, and (carelessly) stated the 
following: 


Therefore a point of inflectio takes place, when neither v nor dv are 0, and 
yet ddv is 0. 


After dealing (not as clearly as one might wish) with the matter of the 
ambiguous signs that have just shown up in the case of division, Leibniz went 
on to deal with powers and roots, very much as in the unpublished manuscript 


is still extant (probably not the one used by the printer, since there are significan differences 
between the draft and the printed version) with the same title as the paper (Bodemann XXXV, 
vol. V / 25, f. 3"). In this translation I use the corrected version. 

134 The overbar is too long in the Acta and it covers the d. This was corrected in the Opera 
and in the Schriften. The same for the product rule below. There are no such overbars in 
the manuscript draft mentioned in the previous footnote. 
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discussed in Section 5.13 [p. 469; 169; 222], specificall stating the equations 
“dx, = a.x9—! dx” and 


ave = = dx NE amis 


Then he named this calculus the “differential” calculus, which is the name that 
we still use today, and then he tooted his own horn for a little while, asserting, 
in particular, that [p. 470; 170; 223] 


it is also clear that our method extends to transcendental curves, which cannot 
be reduced by Algebraic calculation, or do not have a fi ed degree, ... 


But he did not include such an extension in this paper. Leibniz gave next (after 
definin tangent as a line passing through two points of a curve at an infinitel 
small distance) three examples of the differential calculus. He stated the firs 
example after denoting division by [pp. 470; 170; 223—224] 


Logic i x : . 
x : y whichis the same as x divid. by y or —. Let the first or given equation be, 
y 


x:y+ta+bx c—xx:squar. ex + fxx 
+ax/ggtyytyy: JS hh+lx +mxx equ. 0.136 


expressing the relation between x & y, ..., assuming that a, b, c, e, g, h, l, 
m are given [constants]; it is desired 


to fin the quotient dx : dy, to make a long story shorter. 

The stated equation is somewhat complicated and, after solving the prob- 
lem, Leibniz explained that this was done on purpose to show the power of 
the calculus rules. But since we are already familiar with that power, we deal 


135 The Acta used capital X in these equations, except for dx instead of dX. The Opera 
used X throughout, but the Schriften used only x. 

136 The Acta and the Opera reproduced this equation as in the manuscript draft, except for 
using the abbreviation “quadrat.” (translated here as “squar.”) instead of “quadr.” However, 
The Schriften did not place a bar over ex + fxx but placed one over a + bx c—xx. We 
would prefer to write this equation as 


x  (at+bx)(c — x?) y? 
+ + axy/ 9? + y? + = 0. 
y (ext fx")? Vh2 + 1x + mx? 


In one of a large number of (disorganized) enclosures that accompany the manuscript draft 
(on a page numbered 32 in a more recent hand), Leibniz himself wrote this equation using 
fraction bars but no parentheses. 


354 THE CALCULUS CHAPTER 5 


with the third term only as an example, thereby avoiding the use of the quo- 
tient rule. After denoting g* + y* by r, Leibniz found that [p. 471; 170; 224] 
“d,axa/r is + ax dr:2/r + adx,/r ,” 18” which we would prefer to read 


as 
ax 


2Jr 


A few lines below he noticed that “dr is 2y dy” and, therefore, 


d(ax./r) = 77d dy + an/r dx 


d(ax./r) = dr + av/r dx. 


(an equation that Leibniz did not write explicitly). This is an example of the 
chain rule, although not as involved as the one in his November 1676 paper. 

By the new calculus rules and the use of substitutions, Leibniz found the 
differentials of the remaining three terms, expressed them in terms of dx and 
dy only, added the results to the one already found here, equated the sum to 
zero, and solved for dx : dy. 

In the second example Leibniz obtained the refraction law [pp. 471—472; 
171-172; 224—225], and the third, which is the crowning point of this paper, 
is the solution of a difficult long-standing problem, introducing something 
new in what we now call mathematical analysis. The problem had been posed 
by Florimond de Beaune (1601-1652) to Descartes, who could not solve 
it. This is the problem in Leibniz’ words and with reference to his figur 
[p. 473; 172; 226]: 


To fin acurve WW of such a nature that, its tangent WC extended to the 
axis, XC is always equal to the same constant straight line, a. 


Leibniz had firs attempted the solution ina manuscript of July 1676, Methodus 
tangentium inversa, but was not completely successful. Since the slope of the 
tangent is XW/XC = w/a, we have to solve the equation 


dw w 


aot a, , 
which, having set dx equal to a constant b, Leibniz wrote as “w equ. — dw. 


However it is expressed, what we have here is a differential equation, possibly 
the firs one ever in print.1%8 


137 This is the Schriften’s corrected formula. The Acta and the Opera have d, ax./r is 
—axdr:2r./+adx,/r. The manuscript draft has d, xa./r is— xadr:2,/r+dxa,/r. 


138 | eibniz had already define differential equation (@quationem differentialem) in his 
letter to Oldenburg of 21 June 1677 [p. 156; 243; 172; 215; 221]. 
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From a present-day course in differential equations we know that the so- 
lutions are 


w = Ce*!4, or r=ai(2), 
GC 


where C is an arbitrary constant and / denotes the natural logarithm, but there 
was no such course when Leibniz wrote this equation, so he improvised. To 
write his solution, firs in current terminology, consider an arithmetic progres- 
sion of abscissas: x, x +dx,x+2dx,...,andif w is the ordinate corresponding 
to x, the one corresponding to x + dx is 


w b b 
wtdw = w+ dx = w+ 2w=w(14"). 
a a a 


This is how each ordinate is obtained from the previous one, so that they form 
a geometric progression with ratio | + b/a. Recalling now our discussions 
in Chapter 2 on geometric and arithmetic progressions, we conclude that the 
relationship between the given abscissas and their corresponding ordinates is 
logarithmic, which is more simply expressed by the equation x = al(w/C). 

Leibniz’ own solution is much briefer, for, without writing a single ad- 
ditional equation, he simply expressed himself, and concluded his paper, as 
follows: 


. if the x are in arithmetic progression, the w will be in Geometric 
progression, or if the w are numbers, the x will be logarithms: therefore the 
curve WW is logarithmic. 


In this manner, the solution of the de Beaune problem was a feather in the cap 
of the new differential calculus. 

Now let us backtrack to the start of this paper. Let us use our imagination 
and try to place ourselves in the position of one of its readers who was a 
contemporary of Leibniz. Such a reader would not have studied the calculus 
that we have studied and would not have read the manuscripts that we have 
previously discussed at length. What would such a reader make of all this? 
The description of the curves and their tangents seems to be clear, but what is 
all that about dx and dv and the rules of the calculus? The calculus of what? In 
fact, an eminent reader who was a contemporary of Leibniz, Johann Bernoulli, 
a younger brother of Jakob who would become professor of mathematics firs 
at Groningen and then at Basel, wrote down his opinion for the record in 
his autobiography: this calculus of Leibniz was “an enigma rather than an 
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explication ...” 199 


But let us not pursue this line of thought at this moment. Since Newton 
had not published a line on this subject yet, this is how the differential calculus 
was presented to the world. Two years later, Leibniz gave us a little glimpse 
at his integral calculus, but at that time without its present name,'*° although 
he called it “summatorium, aut tetragonisticum” [p. 193; 233]. 

There is too much talk in this paper, but eventually Leibniz got around to 
proving that the quadratures of the circle and the hyperbola, that is, what we 
now call the integrals 


i Va? +x? dx, 
are transcendental, because if they were algebraic [pp. 190; 228-229] 


it would follow with their help that the angle, or proportion or logarithm can 
be cut in a given proportion from straight line to straight line ... 


(nobody ever said that he was a model of clarity). 

It was in this paper that the integral sign appeared in print for the firs time. 
There is a grand total of six integral formulas in it, of which the firs three are 
variants of $xx = f x dx, and the fourth is given as follows [p. 192; 231]: 


Let a be an arc, x its versed sine [that is, x = 1 — cosa], it will bea = 
fdx: 2x —xx- 141 


The next one is about the cycloid, a curve about which we are not as emotional 
today as they used to be back then. The last one, a repeat of the fourth but 
with a misprint [p. 193; 233], is there just to exhort the reader not to forget to 
write the dx. 

Leibniz continued working, writing, and publishing on his new calculus, 
including a proof of the fundamental theorem of calculus that is strongly 


139 Fragments from Bernoulli’s autobiography are contained in Wolf, “Erinnerungen an 
Johann I Bernoulli aus Basel,” in Grunert, ed., Archiv der Mathematik und Physik, 1849. 
This purposely incomplete quotation (to be continued) is from page 693. 

140 “Te seometria recondita et analysi indivisibilium atque infinitorum (Of abstruse ge- 
ometry and the analysis of indivisibles and infinites) 1686. Page references are to Opera 
Mathematica and Leibnizens Mathematische Schriften, in this order. 

141 The overbar covers the integral sign in the Opera but appears correctly in the Schriften. 
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reminiscent of Barrow’s.'4* Two years later he finall incorporated some 
transcendental functions in his version of the calculus. Specificall , he differ- 
entiated the general exponential function, following a suggestion by Johann 
Bernoulli, as follows [p. 314; 330; 324]:'49 


Let x” = y then vlogx = log y; now 


dx dy 
logx = | — and logy= | —- 
x ¥. 
Therefore, 
dx dy 
vf —=— 
x y 


which differentiating, becomes 


d d 
vy + dvlogx = ae aA 
x 


From this equation Leibniz should have concluded (and possibly did) that 
dy = d(x") = vx dx + x°dv log x, 


but, instead, his published equation contains a misprint. However, Leibniz’ 
paper contains the correct formula for the particular case in which v is a 
constant e, which is d(x°) = ex®—! dx [p. 314; 331; 325]. 

In the same paper Leibniz showed some insight into the nature of second- 
order differentials [p. 314; 331; 325]. Although without a reference to the de 
Beaune problem that he had solved in 1684, he considered the same differential 
equation, which, replacing w with x and x with y, is 


dx 


x 
dy a 


142 This is, essentially, the proof in the manuscript discussed at the end of Section 5.13. 
It appeared in “Supplementum geometriz dimensoriz, seu generalissima omnium tetrago- 
nismorum effectio per motum: similiterque multiplex constructio lineze ex data tangentium 
conditione” (Supplement on the geometry of measurement; or most generally on all quadra- 
tures effected by motion: similarly, multiple constructions of curves from data on the tangent 
condition), 1693. 

143 “Responsio ad nonnullas difficultates a Dn. Bernardo Niewentiit circa methodum dif- 
ferentialem seu infinitesimale motas” (Response to several difficultie of Dn. Bernardo 
Niewentiit about the differential method or motivated by infinitesimals) 1695. Page refer- 
ences are to the three works mentioned in the bibliography. 

1447 have modernized Leibniz’ notation. Here is his statement in the Opera: “Nempe sit 
x.” = y fie v.log.x = log.y; jam log.x = Vani & log.y = fidyzy. Ergo 
v. f,dx = f. dy: y: quam differentiando, fi vdx:x +dv. log.x =dy: y.” 
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Or dx = xdy:a. Therefore ddx = dxdy:a. Whence taking dy: a from 
the prior equation becomes xddx = dxdx: whence it is clear that x is to dx, 
as dx is to ddx. 


Today we would write Leibniz’ proportion as 


x dx 

dx ddx 
which shows in this example that dx is the mean proportional between x and 
daz,” 


5.16 THE AFTERMATH 


For reasons that may seem incomprehensible today, a dispute flare up between 
the partisans of Newton’s calculus and the partisans of Leibniz’ calculus. There 
were even allegations of plagiarism raised against Leibniz, for he had been to 
London, where he might have seen a copy of De analysi and stolen Newton’s 
ideas. Such allegations are pure nonsense, because Newton’s calculus is as 
similar to Leibniz’ calculus as an egg is to a chestnut, and the latter’s efforts to 
arrive at an understanding through notation are palpable. '4® However, Leibniz 
had purchased a copy of Barrow’s Lectiones geometrice while in London, and, 
doubtlessly, he had studied it and may have found inspiration from it to view 
quadratures and differentiation as inverse processes. There is also a similarity 
between Barrow’s geometric constructions and those of Leibniz, but also a 
very profound difference. For, whereas Barrow seemed to be content in that 
geometric milieu, Leibniz felt the need to leave it through the choice of a 
notation that would allow him to turn such procedures into an algorithmic 
calculus. Thus, his work represents a leap into the future from where Barrow 
stood.'47 

There is also a substantial difference in approach between Newton and 
Leibniz. Newton was basically a practical man, interested in differentiating 


145 For additional information on Leibniz’ development of the calculus consult Parmentier’s 
La naissance du calcul différentiel, 2000. This is a collection of 26 papers by Leibniz, 
originally published in the Acta Eruditorum, translated into French, with notes and an 
introduction. 

146 For a full account of the dispute see Hall, Philosophers at war: The quarrel between 
Newton and Leibniz, 1980. A popular account can be found in Bardi, The calculus wars. 
Newton, Leibniz, and the greatest mathematical clash of all time, 2006. 

147 Fora possible influenc of the work of Barrow, Newton, and Leibniz on each other see 
Feingold, “Newton, Leibniz, and Barrow too. An attempt at a reinterpretation,” 1993. 
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and integrating everything in sight, with applications to practical problems, 
and from the very beginning provided us with what we now call tables of 
integrals. Leibniz was basically interested in findin the working algorithms 
of the calculus, but did not neglect applications in his early work. We have 
seen how he provided the answer to the de Beaune problem via the solution of 
the firs ever differential equation, and this was a triumph for Leibniz’ calculus. 

And then the inevitable happened. Newton and Leibniz had given to the 
world a calculus that worked but whose foundations were shaky and insuffi 
ciently understood. Someone was bound to pick on this, and someone did. The 
firs was the Dutch physician and mathematician Bernard Nieuwentijt (1654— 
1718).48 He could not understand a number of points from either Barrow, 
or Newton, or Leibniz, from quantities that now are positive but then are zero 
to how infinitesimal could add up to something finite Leibniz printed a 
reply, '49 but the fact is that Leibniz, or anyone at that time, did not understand 
these fin points either. 

This meant that the problem could not go away until it was resolved. The 
leading critic in the British Isles was George Berkeley (1685—1753), one of the 
most prominent empiricist philosophers, a prolifi writer and critic of science, 
philosophy, mathematics, and politics. In 1734, the same year he became the 
Anglican bishop of Cloyne (County Cork, Ireland), he published The analyst, 
or a discourse addressed to an infidel mathematician, '*° his only work written 
in England. Berkeley found this infide guilty of [I] 


the misleading of unwary Persons in matters of the highest Concernment [re- 
ligion], and whereof your mathematical Knowledge can by no means qualify 
you to be a competent Judge. 


Then, it was only fair for Berkeley to [II] 


take the Liberty to inquire into the Object, Principles, and Method of Demon- 
stration admitted by the Mathematicians of the present Age, ... 


148 Considerationes circa analyseos ad quantitates infinite parvas applicate principia, & 
calculi differentialis usum in resolvendis problematibus geometricis, 1694; and Analysis 
infinitorum, seu curvilineorum proprietates ex polygonorum natura deducte, 1695. 

149 “Responsio ad nonnullas difficultate a Dn. Bernardo Niewentiit circa methodum dif- 
ferentialem seu infinitesimale motas.” 

150 Since this booklet is divided into fift sections, references will be given by section 
numbers in roman numerals. It is generally believed that the infide mathematician of the 
title was the Astronomer Royal Edmund Halley (see the Editor’s Introduction to vol. 4 of 
The works of George Berkeley Bishop of Cloyne, pp. 56—57). The relevant portions of The 
analyst are reproduced in Smith, A source book in mathematics, 1929, pp. 627-634. 
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In sections III to VIII Berkeley criticized the method of fluxion but just in 
verbal form, remarking on how difficul itis for the imagination to comprehend 
“those Increments of the fl wing Quantities in statu nascenti” [IV], or to 
“digest a second or third Fluxion ...” [VII]. But then he became more serious 
in criticizing some statements made by Newton after that of Lemma II in 
Book II of the Principia [Motte’s translation, p. 18]. Newton had designated 
the “momentary increments or decrements [of fl wing quantities] by the name 
of Moments;” and then stated that if the sides of a rectangle, A and B, are 
“increasing or decreasing by a continual flux with velocities a and b, then 
“the moment or mutation of the generated rectangle AB will be aB + Ba;” 
and then provided a very dubious proof of this fact [p. 19]. In Berkeley’s 
view, however, if you subtract AB from the product of A + a and B + b what 
remains is aB + Ba + ab, and this gave him occasion to turn Newton’s own 
words against him [IX]: 


And this holds universally be the Quantities a and b what they will, big or 
little, Finite or Infinitesimal Increments, Moments, or Velocities. Nor will 
it avail to say that ab is a Quantity exceedingly small: Since we are told that 
in rebus mathematicis errores quam minimi non sunt contemnendi.'*' 

As for Newton’s way of findin the fluxio of x”, reproduced on page 317, he 
had the following to say [XII]: 


For when it is said, let the Increments vanish, 192 i. e. let the Increments 


be nothing, or let there be no Increments, the former Supposition that the 
Increments were something, or that there were Increments, is destroyed, and 
yet the Consequence of that Supposition, i. e. an Expression got by virtue 
thereof, is retained. 


This is enough to get the idea of Berkeley’s objections, which he extended 
to Leibniz’ calculus too. His problem was not with the results, which he 
accepted as valid, but rather with the faulty logic used to arrive at them. He 
concluded section XXXV with some questions that are frequently quoted (it 
is irresistible): 


And what are these fluxions The Velocities of evanescent Increments? And 
what are these same evanescent Increments? They are neither finit Quan- 
tities nor Quantities infinitel small, nor yet nothing. May we not call them 
the Ghosts of departed Quantities? 


151 Whiteside’s translation of these Latin words can be found in the last sentence of the 
quotation from Newton’s De quadratura on page 317. 
152 They are called “augments” in Whiteside’s translation. 
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The calculus was in trouble and mathematicians knew it. They rose to 
its defense and provided counterarguments to Berkeley’s own, but the bishop 
held his own because he was on the right side. Explanations were neces- 
sary. They ware attempted in a big way by Colin Maclaurin (1698-1746), a 
professor firs at Aberdeen and later at Edinburgh. In response to Berkeley’s 
criticisms he wrote a lengthy treatise, seeking to justify the calculus by means 
of Euclidean and Archimedean geometry.'°? This impossible task may have 
been instrumental in turning mathematicians in the British Isles away from 
infinitesimals 

The continental mathematicians were not so squeamish, and they plunged 
right in. We are in debt to Jakob and Johann Bernoulli, who were the firs to 
make sense of Leibniz’ calculus. Johann had called this calculus an enigma, 
as se Cae above, but that was an incomplete quotation. Leinbiz’ calculus 
was 


an enigma rather than an explication; but that was enough for us, to master 
all its secrets in a few days, as witnessed by the quantity of papers that we 
published next on the subject of infinitesimals 


The result was that Leibniz and the Bernoulli brothers created most of the 
elementary differential calculus before the end of the seventeenth century. 
Johann Bernoulli was the firs to write a differential calculus textbook, the 
Lectiones de calculo differentialium, from lectures of 1691 and 1692, but it 
was not published until 1922. The reason is that Bernoulli had sold the rights to 
his work to Guillaume Frangois Antoine de |’ Hospital (1661-1701), marquis 
de Saint-Mesme, count of Autremont, Lord of Oucques, etc.1° L’Hospital, a 
mathematician himself, had been tutored by Bernoulli on the new differential 
calculus during the latter’s stay in Paris (1691-1692) and had acquired the 
exclusive rights to those teachings. In 1696 I’Hospital published the firs 
differential calculus book to appear in print: Analyse des inifiniment petits 


153 Treatise on fluxions, 1742. In Articles 858 to 861 of Book II (based on Articles 255 
and 261 of Book I), Maclaurin gave, proved, and illustrated by example what we now call 
the second derivative test for maxima and minima [Art. 858]: “When the firs fluxio of the 
ordinate vanishes, if at the same time its second fluxio is positive, the ordinate is then a 
minimum, but it is a maximum if its second fluxio is negative ...” A portion of the Treatise 
containing this topic is reproduced in Struik, A source book in mathematics, 1200-1800, 
pp. 338-341; reprinted in Calinger, Classics of Mathematics, pp. 476—478. 

154 Translated from Wolf, “Erinnerungen an Johann I Bernoulli aus Basel,” p. 693. 

155 But usually referred to as the Marquis de l’H6pital (using the modern French spelling), 
today as well as then, at least in writings that I have seen by Berkeley, Johann Bernoulli, 
and Leibniz. 
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DES 
INFINIMENT PETITS. 


PREMIERE Pantie. 
DU CALCUL DES DIFFERENCES. 


SECTION PREMIERE. 
Od Ton donne les regles de ce caloul. 


Dirrixirion I 


A 


Reproduced from the virtual exhibition El legado de las matemdticas: 
de Euclides a Newton, los genios a través de sus libros, Sevilla, 2000. 


pour lintelligence des lignes courbes, and acknowledged that he had made 
free use of the discoveries of Leibniz and the Bernoullis. Besides marking the 
firs appearance in print of a complete presentation of the new calculus, this 
book is distinguished by the use of the word intégrale to replace summatorium, 
as proposed by the Bernoulli brothers, '°® and a new theorem known and used 
today as |’H6pital’s rule. It appeared in Part I, Section IX, page 145. 

Mixing current terminology with l’Hospital’s notation, the function repre- 
sented by the curve AMD is the quotient of the function represented by ANB 
over that represented by COB (the ordinates AC and PO are positive, but they 
are drawn below the x-axis to avoid clutter). These last two functions vanish at 
the point x = a, represented by the letter B. L’Hospital stated his proposition 


156 This word was introduced by Jakob Bernoulli in “Analysis problematis antehac 
propositi,” 1690, p. 218 = Jacobi Bernoulli, Basileensis, Opera, Tomus primus, 1744, 
p. 423. 


SECTION 5.16 THE AFTERMATH 363 


as follows: 


PROPOSITION I. 


Problem. 


163. Let AMD be acurve (AP = x, PM = y, AB = a) such that the value 
of the ordinate y is expressed by a fraction, the numerator & denominator of 
which, each of them becomes zero when x = a, that is when the point P falls 
over [coincides with] the given point B. It is asked what must then be the 
value of the ordinate BD. 


Then |’Hospital gave a procedure for this task (which we would be very 
happy to revise today) using “an ordinate bd infinitel close to BD,” in which 
he revealed what to do: 


to fin the ratio of bg to bf. ... & therefore if we take the differential of 
the numerator, & divide it by that of the denominator, after having made 
x = a= Ab or AB, we'll have the value that is sought of the ordinate bd or 
BD. 


However, shortly after |’ Hospital’s death, Bernoulli felt free to claim this rule 
as his own discovery.'®’ This claim was validated much later when his corre- 


157 Tn “Perfectio regula: su edita in libro Gall. Analyse des infinimen petits art. 163. pro 


determinando valore fractionis, cujus Numerator & Denominator certo casu evanescunt,” 
1704, p. 376 = Johannis Bernoulli, Opera Omnia, 1742, vol. I, N°. LXXI, p. 401. 
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spondence was published, for in a letter of 22 July 1694, Bernoulli had indeed 
communicated it to l’Hospital along with the firs example in the Analyse.'°® 
Bernoulli (not Leibniz) had also developed the integral calculus, which he 
wrote under the title Lectiones mathematice de methodo integralium, aliisque 
conscripte in usum Ill. Marchionis Hospitali cum auctor Parisii agaret an- 
nis 1691 et 1692 (Mathematical lectures on the method of integrals, and on 
other topics written for the use of the Marquis de |’Hospital as the author 
gave them in Paris in the years 1691 and 1692).1°9 Here Bernoulli developed 
what would eventually be known as the method of substitution; and taught it 
correctly, which is unusual in today’s calculus textbooks. 

L’Hospital’s book became very popular and went through multiple edi- 
tions. It was not until the mid eighteenth century that the competition caught 
up. In Italy, Maria Gaetana Agnesi (1718-1799) published the Instituzioni 
analitiche ad uso della gioventu italiana in 1748, two years before she was 


INSTITUZIONI 
meno ALITLCHE 


ab Uso 
DELLA GIOVENTU' ITALIANA 


D™ MARIA GAETANA 


A GN ES TI 
MILANESE 
Delf Accademia delle Scienze di Bologna . 


d A REGLA-DUGAL CORTE. 
CON LICENZA DE’ SUPERIORI, 


158 Der Briefwechsel von Johann Bernoulli, vol. 1, 1955 pp. 235-236. 

159 They appeared firs in Johannis Bernoulli, Opera omnia. The integral calculus is 
in volume III, pp. 385-558. Some selections, containing the method of substitution, are 
translated into English in Struik, A source book in mathematics, 1200-1800, pp. 324-328. 
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appointed a professor at the University of Bologna. However, Agnesi, without 
refusing the appointment, never took the position. Book 2 is devoted to the 
Calcolo differenziale (including maxima and minima and the study of curves), 
Book 3 to the Calcolo integrale, and Book 4 to Equazioni differenziali. Here 
is an example of differentiation from page 465: 


a 
Let Vax +xx + at — x4, thatis ax +xx+ Vat — x4" , the difference 
[differential] will be 


x dx 


adx +2x dx 


3 
a4 — x44 


Jax +xx+ Vat —x4 


that is 


3 
adx +2xdx x at — x44 —x3 dx 


. : 
aaa q 
2x at — x44 Jax +xx4+ Vat — x4 


This textbook contains no original mathematics, but it is a lucid exposition of 
the subject, which is mainly taught by example. Agnesi became well known 
for this book, which was translated into French and English, but chose to 
remain at home for the rest of her life. Born into a wealthy family, which 
made her education possible, she died in poverty after spending her fortune 
on works of charity. 

And then Euler entered the picture, with the publication of Institutiones 
calculi differentialis in 1755. Euler shows us in this text that he could differ- 
entiate anything in sight, and we can too if only we follow his lead. Let the 
differentiation of the logarithm, on page 143, serve as an example of his style. 
Together with his four-volume /nstitvtionvm calcvli integralis, published from 
1768 to 1770, these books formed a comprehensive treatise of everything that 
was known up to then about the calculus, including many new results. 

However, none of the authors mentioned so far understood the founda- 
tions of the calculus. Nobody really knew what those differentials, those 
infinitesimals those nascent and ultimate ratios, really were. As an example 
of late-in-the-game misconceptions we shall pick on Euler. Such was the cal- 
iber of his genius, second to none in mathematics, that he could have accepted 
a little picking in good humor. 

In the preface to the 1755 book, Euler started by definin [pp. VIII; vii]: 160 


160 For this and the next quotation, this page reference is to the original firs and, after a 
semicolon, to Blanton’s translation. 
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The Differential Calculus, which is a method for determining the ratio of the 
vanishing increments, that any functions take on when the variable quantity, 
of which they are functions, is given a vanishing increment ... 


Note the use of the word function in our present sense. It is the firs time that it 
was used that way by any of the authors whose work has been discussed so far. 
The given definitio is correct but leaves much to be explained, in particular 
the vanishing increments. This is what Euler had to say: 


Accordingly they are called differentials, which seeing that they are without 
quantity, they are also said to be infinitel small; therefore by their nature 
they are interpreted, as nothing at all or they are to be thought of as equal to 
nothing. 


Euler was very insistent throughout his book that differentials are equal to 
zero, which doesn’t sit well with taking their quotients. But he explained this, 
already in the text proper, in Chapter III: On the infinite and the infinitely small 
[Art. 85], by stating that 


when zero is multiplied by any number it gives zero, so that n - 0 = 0, and 
thus n: 1 = 0:0. Hence it is clear that any two zeros can be in a geometric 
ratio [quotient], although, from an arithmetical point of view, that ratio is 
always of equals. 


Then he added [Art. 86]: 


If therefore, we accept the notation used in the Analysis of the infinite dx 
denotes an infinitel small quantity, so that dx = 0, as well as adx = 0, 
where a denotes any finit quantity. In spite of this, the geometric ratio 
adx: dx is finite namely asa: 1;... In asimilar manner, if we must deal 
with separate infinitel small quantities dx & dy, although these are both 
= 0, still their ratio is not apparent. 


However, the quotient dy: dx is not considered in the rest of Chapter III. It 
appears again in Chapter IV, but the next quotation is so out of context that it 
is difficul to understand the appearance of the function P that is used below 
[Art. 120]: 


Whatever function y is of x, its differential dy is expressed by a certain 
function of x, for which we write P, multiplied by the differential of x, that 
is by dx. Thus although the differentials of x & y are in fact infinitel small, 
and therefore equal to zero; still there is a finit ratio between them: namely 
dy:dx = P:1. 
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This is enough to realize that Euler could not bring home the bacon with all 
these gyrations. From our point of view, he was trying to explain what he did 
not fully understand. Without realizing it, he was trying to avoid something 
that is a must: the concept of limit. 


CONVERGENCE 


6.1 TO THE LIMIT 


In our days the definitio of limit is well known and used appropriately. In the 
past even a genius of the firs water like Newton could only intuit the concept. 
And that in itself was a giant leap, even though such a leap is insufficien to 
bridge the infinit gap toward the ultimate ratio that he sought. The need for a 
definitio was soon felt, and Jean le Rond d’ Alembert, of Paris, tried to provide 
it but without success. As coeditor of the famous Encyclopédie he contributed 
the Discours préliminaire and numerous articles, including the following as- 
sessment of Newton’s view of the calculus in the article DIFFERENTIEL of 
volume 4 (CON — DIZ) [p. 985]: 


He never regarded the differential calculus as the calculus of infinitel small 
quantities, but as the method of firs & last ratios, that is the method of findin 
the limits of the ratios. 


Hence, it is imperative to defin limit, and d’Alembert did so in volume 9 of 
the Encyclopédie (JU - MAM) [p. 542]: 


LIMITE, s. f. (Mathémat.) It is said that a magnitude is the limit of 
another magnitude, when the second may approach the firs closer than a given 
magnitude, as small as it can be imagined, and yet without the approaching 
magnitude, ever being able to surpass the magnitude that it approaches; ... 


Then to make sure that his position on the essence of the calculus was prop- 
erly explained and understood, he took on the metaphysics of the differential 
calculus in DIFFERENTIEL and expressed himself as follows [p. 987]: 


E.A. Gonzalez-Velasco, Journey through Mathematics: Creative Episodes in Its History, 368 
DOI 10.1007/978-0-387-92154-9_6, © Springer Science+Business Media, LLC 2011 
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The differential calculus is not about infinitel small quantities, as it is still 
ordinarily said; it is concerned only with the limits of finit quantities. ... 
We shall not say, along with many mathematicians, that a quantity is infinitel 
small, not before it vanishes, not after it vanishes, but at the very same instant 
at which it vanishes;! ... We shall say that there are no infinitel small 
quantities in the differential calculus. 


We get the message, but from our present point of view, d’ Alembert’s definitio 
imposes the excessive requirement that the approaching magnitude should 
not surpass the target magnitude. Actually, it is not a proper mathematical 
definitio but just a play with words. What does it mean to “‘approach”? 
Is time involved? How does one measure this approach? Mathematicians 
were not used to absolute precision at that time, nor did they appreciate its 
fruitfulness outside the realms of synthetic geometry or number theory. 

But then the need for a deeper understanding of the idea of convergence 
was made evident by the attempts to figur out the solutions of new applied 
problems involving infinit series, and it was in this context that it was define 
first 


6.2 THE VIBRATING STRING MAKES WAVES 


When a taut piece of string, tied down at both ends, is either plucked or hit, it 
emits audible vibrations. Their frequency, that is, the number of vibrations per 
second, was firs determined by Brook Taylor in his 1713 paper “De motu nervi 
tensi,” but he believed that the shape of the string at any given instant of its 
motion was a sine wave. There is, however, more than one audible frequency 
of vibration when such a string is in motion. The one found by Taylor is called 
fundamental because it is the lowest frequency, and the other vibrations that 
make up the motion of the string are called harmonics. Intuitively, then, we 
expect the shape of the string to be a sum of sine waves; perhaps an infinit 
series of sine waves. 

D’Alembert was the firs to determine the shape of a vibrating string at 
a given time, and his success was based on the fact that in 1747 he derived 
a partial differential equation that accurately describes the phenomenon in 
question and then solved it.2 Neither d’ Alembert’s notation nor his derivations 


1 This is almost exactly what Newton had stated in the Principia, as quoted on page 311. 


2 “Recherches sur la courbe que forme une corde tendué mise en vibration,” and “Suite 
des recherches,” both of 1747. 
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JEAN LE ROND D’ ALEMBERT 


Portrait by Maurice-Quentin de La Tour. 
Engraving by William Hopwood. 
Lithograph from the author’s personal collection. 


can be considered appealing to modern tastes, so we shall limit ourselves to 
stating his equation and the solution. He considered a string, initially stretched 
between two fi ed points along the axis of abscissas. Then, if the vertical 
displacement of a point of the string at time ¢ at a point of abscissa s is denoted 
by y = g(t, s) [p. 215], and if wa and 6 denote what we now call the second 
partial derivatives of y with respect to ¢ and s, respectively, the differential 
equation is [p. 216] 
2aml 

Q2 
The fraction on the right contains several physical constants that are of no 
interest to us. Today we would write this equation, which is now called the 
wave equation, as 


az 
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and simply say that b is a positive constant. The unit of time can be chosen so 
that b = 1, and doing so, d’Alembert then found the general solution of this 
equation to be [p. 217] 


y=Wt+s)+Tt—s), 


where W and I are “as yet unknown functions” that must be determined from 
the facts that the string is tied down at its endpoints, that it is given no initial 
velocity, and that its initial displacement is known.? 

D’Alembert found these functions [pp. 228—230], but the problem with 
his solution is that it shows none of the vibrations in time that we expected, the 
fundamental and the harmonics. It was Euler who, two years later, in his paper 
“De vibratione chordarum exercitatio,” brought up the fact that the motion of 
the string is periodic in time and made up of many individual vibrations. He 
started by rederiving the differential equation and its general solution by his 
own method, and then, if y = f(x) represents the initial displacement of the 
string, he obtained the following equation for its displacement at a point of 
abscissa x (his choice of variable instead of d’Alembert’s s) and at time t > 0 


[Art. XXII]: 
y= a h(x ttvb) +3 Fe — 1b)" 
As an example, if the initial displacement of a string of length a is given by 
[Art. XXX]] 
. ‘WX . 2nx . 3x 5 
y=asin + 6 sin + y sin + &C, 
a a a 


and if we use this right-hand side as f(x) in the preceding general solution, 
then, simplifying the result and writing v for t./b, the displacement of the 
string at a point x and at time ¢ > 0 is given by 

UX Tv 20x 2nv 3x 3v 


y = asin — cos — + sin cos + y sin cos + &c. 
a a a a a a 


3 An English translation of the relevant parts of d’Alembert’s paper containing these 
derivations can be seen in Struik, A source book in mathematics, 1200-1800, 1969, pp. 352— 
357. 

4 Euler wrote f:(x + tb) [p. 521] instead of f(x + tb). Note, in any event, that 
x —tVb < 0 for ¢ large, so that it is necessary to extend the function f to the negative 
x-axis. Furthermore, if we assume that the left endpoint of the string is at x = 0, the fact 
that this point is tied down implies that f(tWb) + f(—tWb) = 0 for all t > 0, which 
implies that the extension of f must be odd. Then, if the length of the string is a, this 
extension is an odd periodic function of period 2a. 

5 As before, I have suppressed the period in Euler’s abbreviation sin. for the sine. 
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Since v is a constant times t, we now have vibrations, but in order to produce 
them Euler had to choose what looks like a highly artificia initial displacement 
as aninfinit sum of sines. The obvious question is: how representative is this 
of an arbitrary generating curve (courbe génératrice), as d’Alembert called 
the initial displacement [p. 220]? 


With the preceding statements, Euler had opened a big can of worms, one 
that we cannot ignore, for it is at the very center of mathematical analysis. 
Here is how the situation looks before careful examination: if a tight string is 
given an arbitrary initial displacement and then let go, there is little doubt that 
there will be vibrations. Is this reason enough to conjecture that an arbitrary 
function can be represented by the sum of an infinit series of sines? 

The firs to pronounce himself on this subject was Daniel Bernoulli (1700 — 
1782), son of Johann. Upon reading d’Alembert’s and Euler’s papers, he 


DANIEL BERNOULLI CIRCA 1750 


Portrait by Johann Niklaus Grooth. 
From Die Werke von Daniel Bernoulli, Band 2: Analysis Wahrscheinlichkeitsrechnung. 
in Die Gesammelten Werke der Mathematiker und Physiker der Familie Bernoulli, 
ed. by David Speiser, Birkhauser -Verlag, Basel, 1982. 


SECTION 6.3 FOURIER PUTS ON THE HEAT 373 


published his own ideas on the subject in 1753. Elaborating on a previously 
published belief that the shape of the string at a given instant is the superpo- 
sition of individual vibrations, now he stated [Art. XII] that its displacement 
at any given time” 


shall have in the general case the following equation for the same abscissa x: 


. TEX _ 20x . 3x . 50x 
y=asin + B sin + y sin + 6 sin + etc., 
a a a a 


in which the quantities a, 6, y, 4, etc. are positive [affirmatives] or negative 
quantities. 


Bernoulli, who did not write the dependence on time, based his solu- 
tion on physical considerations alone and provided no mathematical reasons 
whatsoever to back it up. Euler pounced on it immediately, the very same 
year, refusing to accept that it could represent the general case [Art. VII].® 
D’Alembert was of the same opinion as Euler on this point: an arbitrary curve 
cannot be represented by the sum of an infinit series of sines, an opinion that 
he published in the article FONDAMENTAL in volume 7 of the Encyclopédie 
(FOA-GYT). But Bernoulli did not surrender his position, for, he said 9 


the last resulting curve [the one represented by the preceding infinit series] 
will include an infinitud of arbitrary quantities [the coefficients] which can 
be used to make the fina curve pass through as many points given in position 
as one wishes, & to identify through them this curve with the proposed curve, 
with any degree of precision that one wishes. 


6.3 FOURIER PUTS ON THE HEAT 


Not too long after these times, great events took place in France that, indirectly, 
would come to settle the argument after the death of its principals. On July 12, 
1789, the people of Paris—then a walled, dirty, unhealthy town of 550,000 
with dusty, narrow streets—rioted after the dismissal of the prime minister 
Necker. What started as a riot, fueled by discontent about food shortages 


8 “Réfl xions et éclaircissements sur les nouvelles vibrations des cordes exposées dans 
les mémoires de 1’Académie de 1747 et 1748,” 1753. 

7 From the translation in Struik, A source book in mathematics, 1200-1800, p. 364. 

8 “Remarques sur les mémoires précédens de M. Bernoulli,” 1753. 

2On page 165 of “Lettre de Monsieur Daniel Bernoulli, ... 4 M. Clairaut ... sur la 
vibrations des cordes tendues,” 1758. 
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and low incomes, soon turned into a full-fledge revolution. A revolutionary 
council was formed that night, and then the Bastille was attacked and destroyed 
on July 14. 

The monarchy was abolished in 1792, and the French republic proclaimed 
on September 22. A new revolutionary calendar, beginning with 1 Vendémi- 
aire, Year I, had been established to mark time from that day; and on 7 Ger- 
minal Year VI—March 27, 1798—a young professor at the newly founded 
Ecole Polytechnique, Jean Joseph Fourier (1768—1830), was made a member 


JOSEPH FOURIER 


Portrait by Julien Léopold Boilly. 
Engraving by Amédée Félix Barthélemy Geille. 
From Cuvres de Fourier, Tome Second, 1890. 


of the Commission of Arts and Sciences of General Bonaparte’s expedition to 
Egypt. 

Upon his return to France in November 1801, after a British victory in 
Egypt, Fourier resumed his post at the Ecole Polytechnique but only briefl . 
In February 1802, Bonaparte appointed him Préfet of the Department of Isére 
in the French Alps, and it was here, in the city of Grenoble, that Fourier returned 
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to his physical and mathematical research. But Fourier’s stay in Egypt had left 
a permanent mark on his health, which was to influence perhaps, the direction 
of his research. He claimed to have contracted chronic rheumatic pains during 
the siege of Alexandria and that the change of climate from that of Egypt to 
that of the Alps was too distressful for him. He lived in overheated rooms, 
wore an excessive amount of clothing even in the heat of summer, and it was 
on the subject of heat that he concentrated his research efforts. 

Fourier had a complete manuscript in 1807, in which he found the general 
equation for the diffusion of heat in a three-dimensional solid."° If v is the 
temperature, this equation is [Art. 28, p. 126] 


dvu_ K - dv = 41 
dt  CD|dx? 


where “v is a function of the three coordinates x, y, z and of time fr,” and K, 
C, and D are physical constants with which we need not be concerned. It is 
now called the heat equation. Then he posed the following problem [Art. 32, 
p. 134]: 


We assume that a lamina or a rectangular surface, of infinit length, is heated 
at its end 1, and is kept at a constant temperature at all points of this edge, and 
that each of the other edges 0 and 0 12 are also kept at a constant temperature 
0 at all their points. It is a question of determining what the stationary 
temperatures at each point of the lamina must be. 


To be precise, a lamina of negligible thickness in the z direction is located 
in the plane strip 0 < x < oo, —-1 < y < 1. The edge at x = 0 is kept at 
a constant temperature | while the edges at y = +1 are kept at a constant 
temperature 0 for all t > 0. It is desired to fin the temperature at each point 
of the lamina after a sufficien amount of time has passed. 

It is assumed that after some time the temperatures have reached a steady 
state and are no longer dependent on time. Thus, the left-hand side of the heat 


10 Mémoire sur la propagation de la chaleur = Grattan-Guinness and Ravetz, Joseph 
Fourier 1768-1830, 1972, pp. 30—440. Page references are to this edition. 

11 All these are partial derivatives, but there was no special notation for them in Fourier’s 
time. The copyist who wrote this passage (the page shown in the photograph is mostly in 
the hand of a professional copyist but finishe in Fourier’s own hand), appears to have used 
partial derivatives, but such is not the case. It is simply that all his d’s look like 0’s. 

12 Fourier had a rather confusing way to refer to the edges of the lamina. Later in his 
paper it became clear what those edges are, and it is explained below this quotation. 
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A portion of page 60 of Fourier’s 1807 manuscript. 
Reproduced from Grattan-Guinness and Ravetz, Joseph Fourier 1768-1830. 


equation vanishes. Also, the negligible thickness of the lamina means that 
there is no appreciable variation in temperature in the z direction, and then 
the last derivative also vanishes. Since the letter z can then be reused, Fourier 
switched his notation for the temperature from v to z, and all this reduces the 
heat equation to [p. 135] 

ce us 0 

dx? uy dyz 


Now this equation must be solved, and Fourier resorted to a method that 
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was rare in his time, a method previously used by d’Alembert in 1750 for the 
vibrating string, '? but after a few lines [p. 356] d’ Alembert made nothing out 
of it. It was Fourier who put this method on the map, and as used by him it 
consists in assuming that there is a solution of the form z = ¢(x)W(y), where 
@ and y are unknown functions to be determined [Art. 33, p. 137]. Taking 
this proposed solution to the equation for z, he obtained 


d(x) Wy) + o()W"(y) = 0, 


and then, dividing by the product 6’ y” (implicitly assuming that this can be 
done), 4 


ae 2, vy) =( 

p(x) ow") 
Essentially, this means that a function of x is equal to a function of y, and the 
only way this can be is for each of them to be constant. That is, 


P(x) = vy) = 
og” (x) w"'(y) 
where A is a constant. This simplifie the problem by transforming a partial 


differential equation in two variables into two ordinary differential equations, 
each in one variable. Then Fourier made the following statement: 


and 


’ 


One sees by this that we can take for # (x) any quantity of this form e”” and for 
w(y) the quantity cos(ny). Therefore we shall assume that z = ae”"*-cos(ny), 
and substituting in the proposed [simplifie heat equation] we shall have the 


condition equation m* = n?. 


This means that we have found solutions of our particular heat equation of the 
form z = ae” cos(ny) and z = ae~"* cos(ny). Fourier’s carelessness in di- 
viding by w’’(y), which is shown now to have zeros, becomes inconsequential 
because it is easily verifie by differentiation that, for each n, any of the two 
functions stated here is indeed a solution of the given equation. 


13 “A ddition au mémoire sur la courbe que forme une corde tendué, mise en vibration,” 
1750, pub. 1752. 

14 This manuscript is in the hand of a professional copyist, who did not use parentheses 
around the variable of a function. For whatever reason, the rest of page 60 was finishe in 
Fourier’s own hand, and he used parentheses. I have used them throughout for consistency. 
However, I write cos and sin instead of Fourier’s cos. and sin. 
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If we take n > 0, which does not make any difference in the cosine factor, 
the firs of the two solutions above must be rejected because it grows without 
bound as x grows. This is impossible with the edges of the lamina kept at 
temperatures | and 0. Fourier continued as follows [p. 138]: 


Therefore the preceding solution is reduced to z = ae~"* cos ny, n being any 
positive number and a an undetermined constant. Now the general solution 
will be formed by writing 


Nyx n NZX 


Z=ayje cosnjy +a.e "2* cosnzy + a3e cosn3y +--+ &c. 

The last assertion is a leap to say the least. It is easy to see by direct substitution 
that the sum of a finit numbers of solutions of the heat equation is also a 
solution. But is it clear that this holds for an infinit number of terms? If we 
humor Fourier and accept his claim, the constants n;,n2,73,... can then be 
determined from the fact that z = 0 for y = +1. Then these constants must 
be odd integral multiples of 1/2, so that the cosine factors vanish. Then, the 


fact that the temperature at x = 0 is | requires that [Art. 34, p. 139] 
1 = a, cos imy + a2 cos amy + a3 cos srry + a4 cos try +--+ &e. 


Fourier concluded with the statement [p. 140]: “It remains to fin the values 
that we must give the coefficient a... az... a3... to satisfy the equation.” 
He did fin them, but their values are not relevant to us at this moment. Later 
[Art. 50, p. 194] he stated that this was just a particular case 


of a more general problem that consists in findin out whether an entirely 
arbitrary function can always be expanded in a series of terms that contain 
sines and cosines of multiple arcs. 


Fourier began [Art. 51, p. 194] with the particular case in which the expansion 
has only sines, 


o(x) =asinx + bsin2x +csin3s +dsin4x + &c. 


He found the coefficients firs by some tedious and laborious method [Art. 61, 
pp. 211—213], and then by a much simpler one [Art. 63, p. 216] that is included 
in today’s texts on the subject. 

Fourier’s belief that an arbitrary function can be represented by an infinit 
trigonometric series was rejected by Lagrange much as Bernoulli’s had been 
rejected by Euler. The main obstacle in accepting that an infinit series could 
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add up to an arbitrary function was the fact that mathematicians were accus- 
tomed to thinking of functions as being given by analytical expressions such 
as polynomials, roots, logarithms, and so on. It is clear that such aperiodic 
functions cannot be represented by an infinit series of periodic functions over 
the whole real line, but somehow they failed to realize that they could be equal 
over a finit interval. 

Fourier gave numerous examples of the expansions of functions in series 
of sines or of cosines [Arts. 64—74, pp. 218—236], and, in particular, he gave 
three different expansions for ix. Here is one [Art. 65, p. 221]: 


l 


x =sinx —1sin2x+ + sin 3x — z sin 4x + i sin 5x —-++ &e. 


2 


>) Td 


Fourier explained as follows: 


The true sense in which one must take this result consists in the fact that the 
sine-like line [curve] that has the equation: 


I 


3 sin3x +---, 


y=sinx — 5 sin 2x + 
and the straight line whose equation is y = $x, coincide in that part of their 
course that is placed above the axis, from 0 to x. Beyond this point, the firs 
line departs from the second and moves down to cut the axis perpendicularly. 


Thus the equation 
y sin x 3 sin 2x + 3 sin 3x Zq sin 4x + 


belongs effectively to the line 017101371 ..., which is made up of inclined 
lines and vertical lines. 


Eventually, in the study of heat diffusion in an annulus, Fourier needed a more 
general expansion in terms of both sines and cosines [Art. 79, pp. 259-260] 
and was able to fin the corresponding coefficients 
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The propagation of heat was the subject chosen by the Institut de France 
for a prize essay in 1811, and on January 6, 1812, the prize was awarded 
to Fourier’s Théorie du mouvement de la chaleur dans les corps solides, an 
expanded version of his 1807 memoir, which was finall printed in 1824 and 
1826. This work was incorporated into Fourier’s monumental work Théorie 
analytique de la chaleur of 1822.'® 


6.4 THE CONVERGENCE OF SERIES 


The work on applied topics described in the preceding two sections presented 
mathematicians with the task of seriously studying the convergence of series. 

This was firs accomplished by José Anastacio da Cunha (1744-1787), of 
the University of Coimbra and the College of Saint Luke in Portugal. He was 
well acquainted with the works of d’Alembert and Euler, and his Principios 
mathematicos of 1790 was the firs book containing the principles of analysis 
written in the modern fashion, using the now classic sequence Definition 
Theorem—Proof—Corollary. Conciseness and rigor were da Cunha’s trade- 
marks. 


15 Tt is in this book that we fin frequent use of two notations that gained favor rapidly. 
The firs is the use of the symbol © for infinit series. It had been originally introduced by 
Euler, stating “summan indicabimus signum &” in his Jnstitutiones calculi differentialis, 
1755, Chapter 1, Art. 26. Fourier initiated the systematic use of & in the Théorie analytique, 
in Art. 235, with the statement: “Donc, en désignant par © cos. ix — a la somme de la série 
précédente” (Thus, denoting by & cos. ix — a the sum of the preceding series). Note the 
use of an overline rather than parentheses. Soon thereafter, he started writing the sum limits 
above and below the letter sigma. By the time he wrote the table of contents he was using 
the almost modern notation, as in 


1 i=—0o 
aux = )> cos. ix i dacos.iaw. 
i=00 


Notice the definit integral on the right-hand side. This way of denoting the limits of integra- 
tion was also introduced by Fourier. In Art. 222 we encounter “L’intégrale i x sin. ix dx” 


and in Art. 231 he stated: “Nous désignons en général par le signe p Vintégrale qui com- 
mence lorsque la variable équivaut a a, et qui est complete lorsque la variable équivaut a b” 
(In general we denote by i : the integral that starts when the variable equals a and ends 
when the variable equals b); however, Fourier had already used the definit integral sign in 
his 1811 prize essay, but it was firs published in the Théorie analytique. 

The work of Fourier presented in this section is essentially contained in the Théorie 
analytique in Arts. 166—169 for separation of variables and the cosine series, Art. 190 for 
the special solution for the lamina, Arts. 219-223 for the sine series, and Arts. 240—241 for 
the full series. See the bibliography for English translations of these articles. 
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Book VIIII starts with the definitio of convergent series [p. 106; 117]:"6 


DEFINITION I. 


Mathematicians call a convergent series that whose terms are similarly deter- 
mined, each one by the number of the preceding terms, in a way that the series 
can always be continued, and finally it comes to be indifferent to continue it or 
not, because however many terms one wants to join to those already written or 
indicated their sum can be neglected without noticeable error: and the latter 
are indicated by writing &c. after the first two, or three, or however many one 
wants: it is therefore necessary that the written terms show how the series 
could be continued, or that this be known in some other manner. 


Note the modesty shown by this statement. Mathematicians would even- 
tually call such a series convergent, but da Cunha was the firs to give this def- 
inition, more than 30 years before anyone else rediscovered it. However, the 
clarity of the statement (from which we have omitted an impossible comma) 


16 Page references are to the Portuguese edition firs and then to the French translation. 
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is impaired by the fact that it begins and ends by emphasizing at length that the 
law of continuation of the series must be clearly stated or possible to deduce 
from the observation of a few terms. For us that goes without saying. The 
heart of the definitio is in the following words: 


finall it comes to be indifferent to continue it or not, because however many 
terms one wants to join to those already written or indicated their sum can 
be neglected without noticeable error. 


What this means, using current notation, is that a series 
uy tug tuyt---tu,t-::: 
converges if for some n we can neglect without noticeable error the sum 


Un+1 + Ung2 + Un43 +++ + Untm 


for every positive integer m. This is, except for some precision added in 
its present-day statement, the modern definitio in mathematical analysis, 
although not the one included in calculus textbooks. 

Da Cunha then illustrated the use of his definitio in the proof of his firs 
proposition [pp. 106-107; 117-119]: 


PROPOSITION I. 


A series of continued proportions is convergent, if the first is larger than the 
second. 


He explained that this means a series of the form 


B BB B B B 
A+B+B—+B x—xX 
A A A 


&e., 
At aos Tie 


in which A > B. He also assumed, but did not state, that B > 0.17 Then he 
began the proof by writing c = B/A, so that his series becomes 


A(l_+c+cc+cce + cece + &c.), 


17 Tn spite of his modernity in other respects, Da Cunha seems to be working with positive 
numbers only. He made similar assumptions in the corollary and the definition quoted in 
this section. Proposition I is valid if B < 0 provided that A > |B]. 
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and chose a number O < A as small as desired, “so that it can be neglected 
without noticeable error.” 

Note that the assumptions on A and B imply that 0 < c < 1. Then the 
terms of the series 


1 1 1 1 
Sea 2 + + &c., 


c cc ccc CCCC 


which Da Cunha wrote next, are well define and grow without bound. There- 
fore, there is a term, which he denoted by d, such that 
A 
O-—cO 


d> 


(he used an argument based on an axiom from Book III [p. 21; 24], but to us this 
is clear). The term that corresponds to d in the series 1+c+-cc+ccc+cccc+&c. 
is 1/d, so that the continuation of this series from this term on is 


Di pes ac OOSs te 
Par ar ar eae 
and this sum can never equal 
d—dc 


no matter how far it is continued. Indeed, if it is continued to the term c’"/d, 
where m is a positive integer, then we have 


1 Cc cc ce" 
i eaced aa] d—dc)=1—c"t! <1. 
(; rey ml d 2 po = 


But the choice of d gives dO — cdO > A, and then 


O 1 
— > . 
A d—dc 
Thus, 
O 1 ¢.  «€¢ 
&e. 
a a ee 
(the sum on the right is finit in spite of the symbol &c.), and 
O A rs cA 4 ccA 4+& 
gE gh age ee 


But the right-hand side is the continuation of the original series. Since O is 
arbitrarily small, it follows from Definitio I that the given series is convergent. 
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Then da Cunha turned to the definin series for e“ and stated the following 
corollary [pp. 107-108; 119]: 


Coro_. I. 


Let a represent any [positive] number. I say that the series 


jist: aa ‘i ada Ae adaada adaddaada & 
ONO 2x3 | 2x3xK4 | 2x3xK4K5 | 


is convergent. 
For the proof he let b be an integer larger than a, denoted the bth term in 
this series by c, and wrote the continuation from c: 
ac + aac aadac + & 
c 
b+1 (+)64+2) 6+)0D64+2)6+3) 


Then he observed that each term in this continuation does not exceed the 
corresponding term of the series 


ce 


i ac ; aac ; adac 
c T I 
bal bs DOr Ty. te De Do 1) 


which converges by Proposition I. Therefore, the preceding two series also 
converge by comparison. 

Then, on this basis, da Cunha was able to defin the general exponential 
function [pp. 108-109; 120]. 


oe 


DEFINITION II. 


Let a [> 0] and b represent any two numbers, and let c be the number that 


makes 
cc ccce 


l+e+ S44 + &e. = 
ee D9 943.” Teagan 


the expression a? will mean a number 


Sere Bee, CUR OUUES ti 
~ oro O63 ' xanga 


and it will be called the number a? the power of a indicated by the exponent b: 


Using Euler’s number e, which da Cunha did not use in this chapter, the 
sums of these two series are e° and e?° = (e°)?, and it is clear that a? = e?°. 
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The remarkable thing is that while Euler was originally content with such 
vagueness as “a¥7 will be a certain value comprised between the limits a? 
and a>,” da Cunha was able to give a correct, rigorous, and general definition 
From it he developed, in the rest of Book VIIII, a correct and rigorous theory 
of powers and logarithms. As an example we quote the following property of 


the exponential function [p. 113; 123]: 


V. Leta, b, c represent any three numbers: it will be (a’)° =a", 
With arbitrary powers on a solid foundation, da Cunha went on to defin 
logarithms [p. 119; 128]: 


DEFINITIONS. 


Il. Considering all numbers as powers of a given number, this is called 
base; and the exponents are called logarithms of the numbers to which they 
belong. 

Ill. The logarithms are called hyperbolic, and also natural, when the base 


1 1 1 
1+14+=+ + &c. 
2 2x3 2x3x4 


Then, using Euler’s / to denote any logarithm, he concluded this chapter 
as follows [p. 120; 128]: 


VIL. la” =nla. 


Let b represent the base, it will be a = b'@ [9. def. 3], and then a” = b”@ 
[9. 5], and then Ja” = nla [9. def. 3]. 


When Fourier presented his prize essay to the Jnstitut in 1811, he was not 
aware of da Cunha’s work on series. He could not have been because the 
French translation of the Principios that appeared in Bordeaux in that year, 
although largely an excellent translation, erroneously misstated the definitio 
by pronouncing a series convergent 


provided that on arriving at a given number of firs terms, it is possible to 
neglect the others without considerable error. In sucha case, and after having 
written enough terms to indicate the law of continuation, we denote the sum 
of those that are neglected, by an etc. after the sequence of firs terms. 
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What this essentially says is that a series is convergent if it is still convergent 
after removing a finit number of firs terms, and this is reinforced by the 
explanation of the meaning of the abbreviation etc. Thus, this definitio is 
useless. It could not have been used by Fourier, and da Cunha’s work was 
largely ignored by the mathematical establishment. '® 

One of the firs definition of convergence in France was given by Fourier 
himself in his prize paper of 1811, later incorporated into his book. Near the 
end of Article 228, he made the following statement about how to establish 
the convergence of a series: 


It is necessary that the values at which we arrive, on increasing continually 
the number of terms, should approach more and more a fi ed limit, and 
should differ from it only by a quantity which becomes less than any given 
magnitude: this limit is the value of the series. 


This definitio is along the lines of the one usually included in calculus texts, 
but it has two main disadvantages over da Cuhna’s. It is necessary to guess 
the limit before attempting to show that it is indeed the limit, and it must be 
possible to sum an arbitrary number of terms of the series in order to compare 
this sum with the guessed limit. 

Working in relative isolation in Bohemia, Bernardus Placidus Johann Nepo- 
muk Bolzano (1781-1848), a priest and professor of theology at Karl Ferdi- 
nand University in Prague (suspended in 1819 due to his liberal views on social 
issues and pacifis beliefs, put under house arrest, and forbidden to publish) 
discovered the better definitio of convergence independently of da Cunha. 
It appeared at the start of Article 7 in a pamphlet proving what we now call 


the intermediate value theorem.'® In Article 6 he established the notation by 
n n+l n+2 n+r 
letting Fx, F x,F x,...,F x be “the values of the sums of the firs n,n+1, 


n+2,...,n+,r terms ofa series” of given functions of x. However, for the 


18 Although the sharp eye of Gauss took notice, and in a letter to Bessel of 21 November 
1811, he stated: “Likewise, all the Paradoxes that some Mathematicians have found about 
Logarithms just disappear, when one does not start with the usual Definitio base!°8*"- = 
Number, which is really satisfie only, when the Exponent is a whole Number, and does 
not make any Sense, when the Exponent is imaginary—but each Quantity that substituted 
for x in the Series 1 +x + 5xXx + ex? + etc.--- gives the Value A, is called the Logarithm 
of A; as I see with Pleasure, the Portuguese Acunha had actually selected this Definition 
(Definition II to IMT in Book VIHI of the Principios [p. 119; 128]), in Briefwechsel zwischen 
Gauss und Bessel, 1880, p. 153 = Werke, 101, p. 364. 

19 “Rein analytischer Beweis des Lehrsatzes dass zwischen je zwey Werthen, die ein 
entgegengesetztes Resultat gewahren, wenigstens eine reele Wurzel der Gleichung liege,” 
1817. The theorem quoted below is on pp. 266—267 of Russ’ English translation of 2004. 
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BERNARD BOLZANO IN 1839 
Portrait by Aron Pulzer. 


purposes of this article and the next, the value of x is supposed to be fi ed, 
so that we are actually dealing with a series of numbers rather than functions. 
Then Bolzano stated his result as follows: 


Theorem. If a series [sequence] of quantities 


1 2 3 n n+r 
Fx, Fx, Fx,..., Fx,..., Fx,..., 


n 
has the property that the difference between its nth term Fx and every later 
n+r . . 
one Fx, however far this latter term may be from the former, remains smaller 


than any given quantity ifn has been taken large enough, then there is always 
acertain constant quantity, and indeed only one, which the terms of the series 
approach and to which they can come as near as we please, if the series is 
continued far enough. 


If the nth term of Bolzano’s sequence is the nth partial sum 


n 
Fx =u, +u2+u3+++:+Un 
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of an infinit series, then 


n+r n 


Fx — FX = Uns + Uny2 + Ung3 +++ + Unter 


for every positive integer r, which shows that Bolzano’s result is equivalent 
to da Cunha’s definition 

The rigorous approach adopted by Bolzano in his paper might have helped 
solidify the foundations of the calculus if his work had reached the mathemat- 
ical centers of the time. As it is, it went largely ignored. Thus it was left to 
Augustin-Louis Cauchy (1789-1857), who had just been appointed professor 


AUGUSTIN-LOUIS CAUCHY IN 1821 


Portrait by Julien Léopold Boilly. 
From Smith, Portraits of Eminent Mathematicians, Il, 1938. 


of analysis and mechanics at the Ecole Polytechnique in Paris, to rediscover 
the condition for the convergence of a series and to place the calculus on a 
solid foundation. His work on infinit series is contained in Chapter VI of 
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weer 


1821. 


his firs analysis textbook, published in 1821,2° in which he announced his 
intention to endow analysis with “all the rigor that is demanded in geometry” 
[p. ij]. 

Cauchy began by giving the following definitio of limit in the Préli- 
minaires [p. 4; 19]: 


When the values successively attributed to the same variable indefinitel 
approach a fi ed value, so that they finall differ from it by as little as one 
wishes, this latter [value] is called the /imit of all the others. 


This definitio may be disappointing to those who regard Cauchy as the father 
of modern analysis and the initiator of the ¢-6 proof. But this was early in 
Cauchy’s work; his use of € was yet to come, and we shall report on it in 
due course. For the time being, he relied on this notion of limit to state the 
following definitio of the convergence of a series [p. 123; 114]: 


20 Cours d’analyse de l’Ecole Royale Polytechnique. 1" partie, Analyse algébrique, 1821. 
Page references are to this edition and, after a semicolon, to the Guvres. Quotations are 
from the Cours d’analyse. For English translations see the bibliography. 
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Let 


Sp = Ug + Uy + Ug + +++ + Un] 


be the sum of the firs n terms, n denoting any whole number. If, for ever 
increasing values of, the sums, approaches indefinitel a certain limit s, the 
series will be said to be convergent, and the limit in question will be called the 
sum of the series. If, on the contrary, while n increases indefinitel , the sum 
S, does not approach any fi ed limit, the series will be said to be divergent, 
and will have no sum. 


These two definition put together accomplish as much as Fourier’s one 
definition in which s,, is to differ from the limit of the series by “less than any 
given magnitude” on increasing n continually. Again, it is still necessary to 
guess the limit of a series before this definitio can be used. But a couple of 
pages later Cauchy stated that it is necessary that [pp. 125-126; 115-116] 


for increasing values of n, the different sums 


Un T Unt, 


Un + Unqi + Un+2; 


that is, the sums of the quantities 
Un, Unaiy Unea, SCs. <. 


taken, from the first in any number that one wishes, end up by constantly 
obtaining numerical [absolute] values smaller than any assignable limit. Con- 
versely, when these several conditions are satisfied the convergence of the 
series is ensured. 


This is what da Cunha and Bolzano had already stated. Cauchy, who gave 
adefinitio of convergence requiring knowledge of the sum, made this into a 
necessary and sufficien condition that does not require it (but he did not prove 
the sufficien y). Then he used this result, which has become known as the 
Cauchy criterion, to examine the convergence of several series [pp. 126-129; 
116-118]: the geometric series, the harmonic series, and the series for the 
exponential (which he dealt with in the same manner as da Cunha). 

But what is truly remarkable about Cauchy’s contribution is that he did not 
limit himself to stating the definitio of convergence, but immediately started 
proving theorems giving tests for convergence: the root test [p. 132; 121], 
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the ratio test [pp. 134-135; 123],21 the condensation test [p. 135; 123], the 
logarithmic test [pp. 137-138; 125], and the alternating series test [p. 144; 
130].2* Later, in a separate publication and after he had already define the 


21 Edward Waring (1734-1798), Lucasian professor of mathematics at Cambridge, had 
already stated a ratio test in his Meditationes analyticae of 1776. In PROB. III. he made the 
following obscure statement [pp. 298—299; PROB. II., pp. 349-350 of the 1785 edition, in 
which there are minor variations and one error]: 


Let z be the distance from an arbitrary term to the firs term of the series, and also 7 
& T’ two successive terms of the given series, & let there be given an equation of the 


: : b b 
relation between T& 7", viz. T=T" x (az’+ Sar = +&c.) where az’ + ak = +&c. 
gg Sg £ 
is a series following the descending dimensions [powers] of the quantity z, then if 
i scat! elle oer 


does not have a ratio [that is] positive & smaller than the ratio of equality by a finit 
[quantity], then the sum of the preceding series will be an infinit quantity, but if the 
opposite is true finite 


c ‘ fa ged 
+ — + &c. where z is assumed to be an infinit quantity, 


Waring was, of course, thinking of a series of positive numbers and by “the ratio of 
equality” he must have meant 1. Then his inequality is 1 : —z+z(7/T") £1, where rr > 0 
is smaller than | by a finit quantity, in which case, he claimed, the series has an infinit 
sum. If, however, 1 : —z + z(7/T’) <r < 1 then the series has a finit sum. If we think 
that z means our n, if we write u, instead of Tand u,,+; instead of 7’, and if we defin 
€ = (1/r) — 1, then Waring’s statement may be interpreted as saying that if 

—n+n a >l+e>1 
Unt+1 


[oe] 
for all sufficientl large n, then the series }*u, converges, and otherwise it diverges. 


Waring paid a price for his lack of clarity, sites it was apparently not widely known that 
this test was his when it was essentially rediscovered by Joseph Ludwig Raabe (1801-1859), 
a German mathematician working in Zurich, and it is now known as Raabe’s test. 

In the 1785 edition [p. 350] Waring showed how to use this test to prove the convergence 
of what we now calla p-series for p > 1 (using the binomial theorem to expand (n + 1)?), 
thus showing that this test is more powerful than Cauchy’s root or ratio tests. Then he 
gave a simpler ratio test for convergence [p. 351]: “If at an infinit distance 7: T’ have a 
ratio larger than equality by a finit [quantity], then the series is finit ... ” and then gave 
three examples. Even this test is more general than Cauchy’s in that it does not require the 
quotient uv, /Uy+1 to have a limit asm — oo, which makes it applicable to series such as 


i 1 1 1 1 
l+-+——4+ + 


at273'R 3p pip pt 


22 First fully given by Leibniz as Proposition XLLX of De quadratura arithmetica [f. 33”; 
p. 115; 323; 125]. Later, ina letter to Johann Bernoulli of 25 October 1713, he stated that a 
series converges whose terms “decrease endlessly and are alternately positive and negative.” 
In Gerhardt, Leibnizens Mathematische Schriften, Sec. 1, Il, vol. 2, 1856, p. 923. 
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integral, he gave the integral test.2° 


Next we reproduce the alternating series test and Cauchy’s “proof by ex- 
ample,” which uses the so-called Cauchy criterion [pp. 144-145; 139-131]. 


om THEOREM. Ifin the series (1) [up +u, +u2+:+++unt+&c....] the 
absolute value of the general term uy decreases continually and indefinitely, 
for increasing values of n, if in addition the different terms are alternately 
positive and negative, the series will be convergent. 


Consider, for example, the series 


a oe ee ee ee 
, >? T 3° 4’ T C.. eo pel 


+ &e. 


The sum of terms whose order exceeds n, if taken in number equal to m, will 


ne 1 1 1 1 1 
+ + &e.. . 
n+1 n+2 n+3 n+4 n+m 


1 1 1 1 1 
= KC..... 
n+l (5 =a) (4 =) 


1 1 1 1 1 1 
= t t + &C..., 
n+l n+2 (— =a) (— —z) 


being evidently between 
1 1 1 
and - , 
n+1 n+1 n+2 


will decrease indefinitel for increasing values of n, whatever m may be, 
which suffice to establish the convergence of the proposed series. The same 
reasoning can evidently be applied to all the series of this kind. 


A proof of the convergence of Fourier’s trigonometric series was attempted 
by several mathematicians, including Fourier himself throughout his life. He 
was never able to give a rigorous proof, but one of his sketches ** would be 


23 Theorem 2 in “Sur la convergence des séries,” 1827, p. 226 = Guvres, Ser. I, vol. VI, 
pp. 272-273. 


24 Théorie analytique de la chaleur, Art. 423. 
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useful to the man who finall did. In 1822 a young Prussian, Johann Peter 
Gustav Lejeune Dirichlet (1805-1859), came as a student to Paris, then the 
mathematical center of the world. He attended a scientifi salon hosted by 


J.P. G. L. DIRICHLET IN 1853 
Portrait by Julius Schrader. 
From Dirichlet’s Mathematische Werke of 1889. 


Fourier, and became acquainted with trigonometric series and Fourier’s sketch 
of the convergence proof. It was not until 1829, however, that Dirichlet could 
complete a valid proof.*° After replacing a certain trigonometric identity in 
Fourier’s sketch of proof with one of his own, he was able to give sufficien 

conditions for the convergence of a Fourier series: that the function to be 
expanded have a finit number of jump discontinuities and a finit number of 
maxima and minima. 


25 “Sur la convergence des séries trigonométriques qui servent a représenter une fonction 
arbitraire entre des limites données,” 1829. 
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6.5 THE DIFFERENCE QUOTIENT 


With the matter of the convergence of series already settled and the concept 
of limit on the table, the time was ripe for a clarificatio of Newton’s ultimate 
ratios. This could be done in two ways: by putting the concept of infinitesima 
ona fir ground and then using it correctly, or by properly definin the limit 
of a ratio of vanishing quantities. 

Da Cunha adopted the firs approach in Book XV of the Principios, which 
starts as follows [p. 193; 196]: 


DEFINITIONS. 


I. [fan expression admits more than one value, when another expression 
admits only one, this one is called constant, and that one, variable. 
Il. ... anda variable that can always admit a value smaller than any 
proposed quantity, is called infinitesimal. 


This is all he needed to address the problem of ultimate ratios, although he 
did it in a notation and language that may well appear archaic and confusing 
to us. To soften the blow he explained that he would denote a function of 
a root? x by a Greek capital letter, such as P. Then, x denotes the value 
of I’ at x, with no parentheses enclosing the variable. This takes care of the 
notation, but we keep his confusing language in the translation of the next two 
definitions and then explain below [p. 194; 197]. 


DEFINITIONS. 


II. Having chosen any quantity, homogeneous to a root x, to be called 
the fluxion of this root, and denoted thus dx; we call the fluxion of x, and 
will be denoted thus dY'x, the quantity that would make 


dVx 
dx 


constant, and 
T(ix+dx)—Tx dx 


dx dx 


26 As we saw in the preceding section, da Cunha referred to the independent variable as 
the “root” of a function. 
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infinitesimal or cipher [zero], if dx were infinitesimal, and constant all that 
does not depend on dx. 

V. Every quantity is called fluent of its fluxion, and it is denoted by writing 
this sign [ immediately before the fluxion. 


What IIII means is that dx is just a new variable, expressible in the same 
units as x; that dI’x is the product ofa function of x times dx (and thus the firs 
quotient in the definitio is “constant” in the sense that it “does not depend 
on dx”), and that the last expression is, for each fi ed x, infinitesima as a 
function of dx. 

To show that he knew what he was talking about, he immediately stated 
and proved the following propositions [pp. 194—195; 197-198]: 


PROPOSITIONS. 


I. x infinitesimal will make Ax + Bx* + Cx? + Dx4 + &c. infinitesimal, 
if the coefficients A, B, C, D, &c. were constants. 


Let n be the number of coefficient A, B, C, D, &c., and P any quantity 
larger than each of them:2” let QO be any proposed quantity: take x < Q/nP, 
and < 1: it will be 


1 1 > 1 P 
-Q>Px, -Q>Px*, -Q>Px, 
n n n 
and so on; and then 
1 
nx -Q, 
n 
that is 


QO > Ax + Bx? + Cx> + Dx*+ &c. 


This may be the firs ¢-5 argument in history. That is, what we now call an 
€-6 proof. Just write ¢ instead of Q and choose 6 = ¢/nP. Da Cunha showed 
that if x < min{6, 1} then Ax + Bx? + Cx? + Dx*+ &c. < €. This is modern 
analysis and shows that his definitio of infinitesima works. He used this 
proposition and the binomial theorem to prove the next, findin the fluxio of 
ee cae 


I. d(x”) = nx"! dx. 


27 Da Cunha continued to assume that all his numbers are positive, but the modern reader 
can easily generalize his argument. 
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For dx infinitesimal and what does not depend on dx, constant, make 


nx"! dx 
5 [= nx""} 
x 


constant and 
(x-+dx)"—x" nx"! dx 


OST ets ~ 2 3 ay? 4 & 
= n— + A x x x Cs 
n 5 x x 5 3 

infinitesimal 


In a similar manner, da Cunha then proved the following propositions 
[pp. 195-196; 198-200]: 


Wl. d(x +x) =dx+dIx. 
Wl. d(a+ bx) = bdx. 
V. d(a+x)") +++ =n(at+x)""! dx. 


VI. dx infinitesimal and what does not depend on dx constant, make 
T(x + dx) — Ix infinitesimal 


VIL. d(xTx) = dxlx+xdPx. 


VIII. Let x represent any [positive] number, and let / indicate hyperbolic 
logarithms: it will be dx = x dix. 


Using the fact that x = e’* and the series for the exponential, da Cunha 
proved this proposition as follows: 


For it is 


1 1 1 1 
= tix + =(Ix) + s(x) + —(Ix)* 4 Ix) + &e. 
dx a(t Ix 5 x) rac x) 7a‘ x) 120‘ x) + &c 


2 4 5 
= dix + 5 Uxpdlx + = (lx)?dlx + 5g layradlx + T00 lx) ax + &e. 


1 1 1 
= (: Ix 4 5 (xy 5 lx xq C2)" &e.) dix 


= xdlx. 


28 Tn the original, the last exponent is n — 2, but this is an obvious typo that was corrected 
in the French translation. 
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After stating two corollaries, he turned his attention to higher-order dif- 
ferentials, stated his version of Taylor’s theorem, and proved some equalities 
for what we now call mixed partial derivatives [pp. 197-200; 200—203]. With 
one exception, the rest of Book XV is about arcs, areas, and volumes. The 
one exception is on indefinit integration [p. 202; 205]: 


XVI. Between two fluent of equal fluxion there can be no difference 
that depends on the root [independent variable]. 


Let dP'x = dAx, and if it is possible, let lx — Ax = Ox: it will be 
d@x = dl'x — dAx = 0: which shows that ©x cannot be a function of the 
root. 


To sum up, da Cunha gave us the so-called difference quotient 


T(ix+dx)—Tx 
dx 


for the firs time in print, and built on it what can be viewed as a quasi- 
rigorous but incomplete differential theory. However, it is still quite different 
from today’s presentations in calculus textbooks. 

The difference quotient was independently introduced as the fundamental 
notion of the differential calculus by Simon Antoine Jean L’Huilier (1750— 
1840), of Geneva. In 1786, his essay Exposition élémentaire des principes 
des calculs supérieurs won the prize offered by the Berlin Academy for “a 
clear and precise theory of what is called Infinity in mathematics.” 29 In §XIV 
[p. 31] of this text, L’ Huilier expressed himself as follows: 


To abbreviate & to facilitate the calculus with a more convenient notation, 


AP 
it has been agreed to otherwise denote lim. ae the limit of the quotient of 
x 
P 
the simultaneous changes of P and x, namely by ore so that lim. ei and 
x x 


dP ; 
—— denote the same thing. 
dx 


Adding a few lines below [p. 32]: 


. one must not think of this symbol as composed of two terms dP, & 
dx; but as a unique & unbreakable expression giving the limit quotient of 
AP over Ax. 


29 Page references are to the Berlin edition of 1787. 
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While these statements are valuable and the notation “lim” is used to this 
day, the fact is that l’Huilier’s work contains very little of real value beyond 
some elementary examples. It is also based on an antiquated definitio of 
limit, with which he began §I of Chapter 1 [p. 7]: 


1“ Definition. Let a variable quantity, always smaller or always larger than 
a proposed constant quantity; but that can differ from the latter less than any 
given quantity no matter how small: this constant quantity is called the limit 
in largeness or in smallness of the variable quantity. 


This is a regression to the d’Alembertian idea that a variable quantity cannot 
“surpass the magnitude that it approaches.” °° 

Eventually, 1’Huilier would reach some wrong conclusions, such as the 
following [§LXVIIL, p. 167]: 


If a variable quantity, susceptible of a limit, enjoys constantly a certain 
property, its limit enjoys the same property. 


Not so. We know, for example, about an irrational number as the limit of a 
sequence of rational numbers. 

Another contestant for the 1786 Berlin prize was the Frenchman Lazare 
Nicolas Marguérite Carnot (1753-1823). Since his 1785 essay Dissertation 
sur la théorie de l’infini mathématique did not win the prize, it was not pub- 
lished until much later. A revised and expanded version was published in 
Carnot’s own time, but it did not really advance the calculus as we know it. 

As president of the Berlin Academy, Joseph-Louis Lagrange (1736-1813), 
born in Turin as Giuseppe Lodovico LaGrangia, knew that the entries in the 
1786 competition were weak, and decided to revise an earlier attempt of his 
own to make the calculus rigorous in an algebraic manner.*' This revision 
was published in 1797 after he moved to France, already a member of the 
Academy of Sciences in Paris, with a title that clearly expressed his intention: 
Theory of analytic functions, containing the principles of the differential cal- 
culus, free of any consideration of infinitesimals or vanishing [quantities], of 
limits or fluxions, and reduced to the algebraic analysis of finite quantities.>2 


30 |Huilier would relax this requirement in the 1795 edition [pp. 17-18], but only for 
alternating series. His general definitio [p. 1] was one-sided. 

31Sur une nouvelle espéce de calcul relatif a la différentiation et a l’intégration des 
quantités variables,”1772. 

32 Théorie des fonctions analytiques, 1797, 1813. Quotations are from the second edition 
of 1813, except for enclosing the variable x in parentheses. Page references are to this 
edition and to its reproduction in Zuvres de Lagrange, vol. 9, 1881. 
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For obvious reasons, Lagrange did not like infinitesimals and he thought that 
fluxions define in terms of motion in time, were not admissible in mathe- 
matics. He also refused to accept Euler’s zero over zero quotients, but found 
something very promising in the use that Euler had made of infinit series in 
the Introductio. 

Lagrange based his study on his belief that a function can always be ex- 
panded in a power series in the following manner [pp. 2; 15—16]: 


When the variable of a function is given any increment whatever, by adding 
to this variable an undetermined quantity, we can by the ordinary rules of 
algebra, if the function is algebraic, expand it according to the powers of 
this undetermined [quantity]. The firs term in the expansion will be the 
proposed function, which will be called the primitive function; the following 
terms will be formed from functions of the same variable, multiplied by the 
successive powers of the undetermined [quantity]. These new functions will 
depend only on the primitive function from which they are derived, and can 
be called derived functions. In general, whatever the primitive function may 
be, algebraic or not, it can always be expanded or supposed to be expanded 
in the same manner, and thus originate the derived functions. 


This statement is reproduced in full because it contains terminology that 
is in use to this day, although Lagrange’s derived functions are now called 
derivatives.°° However, it is best to give the expansion in the following no- 
tation. If f represents the primitive function and i is the increment of the 
variable x, Lagrange expanded f(x + i) [pp. 8; 21-22] 


in a series of this form 


f(x) + pitgqi? +ri? + ete., 


in which the quantities p, qg, r, etc., coefficient of the powers of 7, will be 
new functions of x, derived from the primitive function of x, and independent 
of the undetermined i [for indéterminée]. 


In this formulation, it is possible to consider the difference quotient and 
show that there is no problem in putting i = 0 [p. 24]: 


33 As for these terms, John Collins’ use of the words “derived” and “Derivative Equation” 
in the present technical sense, in a letter to James Gregory of 8 November 1672, p. 244, is 
perhaps the earliest in the English language. 
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Thus we shall have 
fe +i) = f(x) +iP 
and therefore f(x +i) — f(x) =iP, and consequently divisible by 7; once 
the division is performed, we shall have 
_ f@+)-f@), 


1 


P 


Here P = p+qi+ri*+---, so that P = p fori = 0. Thus, there is no 
need for limits, and no differentials or fluxion have been used. Eventually, 
Lagrange introduced a new notation for the derived functions [p. 32]: 


ot eee () -_ 34 
2 2-3 


so that his general expansion is a Taylor series, and carried on with his program 
of making calculus rigorous. 

Unfortunately, there is a hole in the bucket, and it is precisely his basic 
belief that every function can be expanded in the stated manner. We know 
that this is not the case, as was plainly demonstrated by Cauchy in a paper of 
1822.°° Here he showed that if 


p= 7G), 


fase" 


(completed with the value f(0) = 0), then f’(0) = 0, f’(0) = 0,... , so that 
this nontrivial function cannot equal its Maclaurin series [pp. 49-50; 277— 
278]. The same is true of 


f(Qx)= eli sin’ x and f= el atba texte) 


where a is a positive constant and a + bx +. cx?+ --- anentire function of x. 
As a consequence, the functions 


en and e* + ex 
have the same Maclaurin series, but its sum can equal only one of them [p. 278]. 
Thus Lagrange’s approach is also unsatisfactory, and the answer to the 
problem of providing rigor to the calculus must lie elsewhere. 


34 Tn the original 1797 edition Lagrange did not use parentheses about the variable x 
[p. 14]. 

35«Sur le développement des fonctions en séries et sur l’intégration des équations 
différentielles ou aux differences partielles,” 1822 = Guvres, Ser. II, Il, Page references are 
to the original paper firs and then to the Zuvres. 
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6.6 THE DERIVATIVE 


Cauchy took a different path and established his calculus on the concept of 
limit, adopting |’Huilier’s notation for it. He presented the differential part 
in the firs twenty lessons of his second book: Résumé des lecons donnés a 
l’Ecole Royale Polytechnique sur le calcul infinitésimal of 1823. 

As da Cunha had done before him, Cauchy began by definin variable 
and constant, and then limit [p. 1; Te using word for word, in each case, 
the definition previously given in the Cours d’Analyse. Then he define 
infinitesima as a variable whose numeric values [p. 4; 16] 


decrease indefinitel in such a manner as to become smaller than any given 
number. 


This is a less precise version of what da Cunha had said, but Cauchy added: 

“A variable of this kind has zero for a limit.” As examples, Cauchy evaluated 

the limits of ; 
sin a 


and ad Ae 


a 


as a — 0, which he would need later.°” 

In the second lesson he introduced the terms independent variable and 
function [p. 5; 17], and later define continuous function much as Bolzano 
had already done, all of these with the meanings in use today. And then, in 
the third lesson [p. 9; 22], we fin a breakthrough. Referring to a continuous 
function y = f(x) and denoting an increment in x values by Ax = i, he 
considered the difference quotient (his italics) 


Ay — f@+i)— f@) 


Ax i 


in the case in which i is infinitesimal so that both numerator and denomina- 
tor are infinitesimal °° Here is the complete quotation containing Cauchy’s 
discovery of the derivative, in which he adopted some of Lagrange’s notation 
and terminology [pp. 9; 22—23]: 


36 Page references are to the Résumé firs and, after a semicolon, to Guvres, Ser. Il, IV. 
Quotations are from the Résumé. 

37 He had already found the latter in the Cours d’Analyse [pp. 166-167; 147-148]. 

38 Cauchy had already considered this difference quotient in his Cours d’analyse and 
evaluated its limit in some examples [pp. 62—64; 65—67]. 
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But, while these two terms indefinitel and simultaneously approach the limit 
zero, the quotient itself could converge to another limit, either positive, or 
negative. This limit, when it exists, has a specifi value, for each particular 
value of x; but it varies with x. Thus, for example, if we take f(x) = x”, 
m denoting a whole number, the quotient of the infinitel small differences 
will be 


x +i)" — x" m(m— 1 
( + y = mx"! ( ) m2; 


i 1-2 


and it will have the quantity mx’"~! as a limit, that is, a new function of the 
variable x. It will be the same in general; only, the form of the new function 


f@ +i) — f@) 


that will serve as the limit of the quotient will depend on 


i 
the form of the proposed function y = f(x). To indicate this dependence, 
the new function is given the name derived function, and it is denoted, with 
the aid of an accent, by the notation 


y’ or f'(x). 


In typical fashion, Cauchy then proceeded to compute the derivatives (de- 
rived functions) of “the simple functions that produce the operations of algebra 
and trigonometry” [p. 10; 23], that is, a+ x,a—x,ax,a/x, x", A*, Lx (the 
logarithm to base A in this context), sinx, cos x, arcsinx, and arccos x. To 
deal with some of these he used the limits found in lesson one, but for the 
last two he needed a new result on composite functions, and he produced it as 
follows [p. 11; 25]: 


Now let z be a second function of x, related to the firs y = f(x) by the 
formula 

z=F(). 
z or F(fx) will be what one calls a function of a function of the variable x; 
and, if we denote by Ax, Ay, Az, the infinitel small and simultaneous 
increments of the three variables x, y, z, we shall fin 


Az Fa+Ay)—FQ) _ FQx+Ay)— F(x) Ay 
Ax Ax i Ay Ax 


and then, passing to the limit, 
2! = yl F'(y) = f').F' (fx). 
In this way the chain rule appeared for the firs time in print, and then Cauchy 


used it to compute the remaining derivatives of simple functions. For example 
[pp. 12; 25-26], 
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for y = arcsin x, 


1 1 
siny=x, ‘cosy = 1, f= 
y y cosy Es i 
for y = arccos x, 
cos "x —sin 1 : ae a 
=X, _ = A = ; = . 
= 2S siny VT? 


After this, Cauchy took flight and we can only feel like the cameraman at 
Kitty Hawk watching Orville Wright getting farther and farther away. In the 
fourth lesson he define differential and differentiation, in the fift he differen- 
tiated sums, differences, products, and quotients and introduced logarithmic 
differentiation, in the sixth he dealt with maxima and minima, the seventh 
contains a treatment of undetermined limits (co /oo and the like) and the mean 
value theorem,*? the eighth deals with partial derivatives, in the ninth he gave 
us the chain rule for functions of several variables and differentiated implicit 
functions, in the tenth he explored maxima and minima of functions of several 
variables, in the twelfth he told us about higher-order derivatives for functions 
of one variable and in the thirteenth for functions of several variables, the 
fourteenth dealt with total differentials, and in the fifteent and the sixteenth 
he returned to maxima and minima. He finishe at the twentieth with partial 
fraction decomposition. Thus, almost the entire differential calculus sequence 
was covered in the firs 121 pages of this book. 

Cauchy used the €-é notation and this type of proof, but not quite as often or 
as precisely as we would like to think. The following example from Lesson 7 
is probably his clearest use of this method [pp. 27—28; 44—45]: 


Theorem. Jf, the function f (x) being continuous between the limits x = xo, 
x = X, we denote by A the smallest, and by B the largest of the values that the 
derived function f'(x) attains in this interval, the ratio of the finite differences 
f(X) — fo) 
(4) ee 
— x9 


will be necessarily between A and B. 


39 First obtained by Lagrange in the form “Ifa prime function of z, such that f’z is always 
positive for all the values of z, from z = a to z = b, b being > a, the difference of primitive 
functions corresponding to these two values of z, namely, fb — fa, will necessarily be a 
positive quantity,’ Théorie des fonctions analytiques, 1797, Art. 48, p. 45 = 1813, Art. 38, 
p. 63 = Gwvres de Lagrange, vol. 9, Art. 38, p. 78. 
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Demonstration. Let 5, €, denote two very small numbers, the firs chosen in 
such manner that, for numeric [absolute] values of i smaller than 6, and for 
any value of x between the bounds xo, X, the ratio 


f(x +1) — Ff) 


1 


will always remain larger than f’(x) — a and smaller than f’(x)+e. If we 
insert nm — 1 new values of the variable x between the bounds xo, X, namely, 


X1, X2, «++, Xn-1, 
so as to divide the difference X — xo into elements 
Hy 0s 2 Sig Sua Ph MS Ret; 


which, all of them having the same sign, have numeric [absolute] values 
smaller than 5; then the fractions 


Fa) = FG) f@2) — fe) -_ F(O) = $@n-) 


X1 — Xo x2 —X] X — Xn-1 


(5) 


are bound, the firs between the limits f’(xo) —€, f’ (xo) +, the second 
between the limits f’(11) —¢, f’(x1) +6, &c. ... will all be larger than 
the quantity A — «, and smaller than the quantity B + «. Moreover, 
since the fractions (5) have denominators of the same sign, if we divide 
the sum of their numerators by the sum of their denominators, we 
shall obtain a mean fraction, that is, bound between the smallest and 
the largest of those considered [see |’ Analyse algébrique, note II, 12.° 
theorem]. The expression (4), with which this mean coincides, will 
thus itself be bound between the limits A — €, B + €; and, since this 
conclusion holds, no matter how small the number ¢, it can be asserted 
that the expression (4) will be between A and B. 


In 1829, with the Résumé out of print, Cauchy published a modifie edition 
of the differential calculus separately.*" 


40 There is an obvious typo in the Guvres, where the last expression appears as f(x) —€. 
a Lecons sur le calcul différentiel, 1829. 
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6.7 CAUCHY’S INTEGRAL CALCULUS 


The second half of the Résumé des lecons donnés a l’Ecole Royale Polytech- 
nique sur le calcul infinitésimal is devoted to the integral calculus, which 
Cauchy developed as extensively as the differential calculus. 

Mathematicians of the early nineteenth century had abandoned the Leib- 
nizian concept of integral as a sum, perhaps having some difficult accepting it 
as the sum of infinitel many infinitel small quantities, in favor of the integral 
as a primitive or prederivative. Cauchy reversed that approach in the twenty- 
firs lesson of the Résumé. He considered a continuous function y = f(x) and 
n — | increasing (or decreasing) values of the variable x between the bounds 
xo, X, namely [p. 81; 122] 


X], X2, +++, Xn—-1- 
Then he formed the sum 


S = (x1 — Xo) f(X0) + (2 — x1) f 1) 4 +X = Mn) f On-1)- 


Assuming that f was what we now call uniformly continuous, Cauchy proved 
that [p. 83; 125] 


if we make the numeric values of these elements [the lengths x; — x;_] de- 
crease indefinitel , while increasing their number, the value of S will become 
practically [sensiblement] constant, or, in other words, will ultimately attain 
a certain limit that depends uniquely on the form of the function f(x), and 
on the extreme values xo, X of the variable x. This limit is what we call a 
definite integral. 


For this definit integral he proposed three notations [p. 84; 126], the firs of 


which, 
x 
[seas 
x0 


for which he credited Fourier, he considered the simplest. 

In the next two lessons Cauchy proved some of the elementary theorems for 
definit integrals that are familiar to us from calculus textbooks and evaluated 
some examples. In the twenty-fourth and twenty-fift lessons he discussed 
improper integrals and define the principal value. Then, in the twenty-sixth 
lesson, he define a new function [p. 101; 151] 


F(x) = / fede 
XO 
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of the variable upper limit of integration, but without thinking of using what 
we call a dummy variable in the integrand. From the mean value theorem for 
definit integrals, which he had established in the twenty-third lesson, Cauchy 
obtained 


F(x) = (x — x0) f[xo + O(& — x0)], F(xo) = 0,7 


“@ being a number smaller than unity” (and not smaller than zero). From one 
of the elementary theorems previously proved, 


Xx+a x x+a 
i f(x) dx — i f(x)dx = / f(x)dx =af(x + 0a), 
x0 x0 XX 


or 
F(x +a) — F(x) =af(x + Oa). 


Thus, if f is continuous, dividing by a [p. 101; 152], 
we shall conclude, passing to the limit, that 
F(x) = f(x). 


Therefore the [stated] integral, considered as a function of x, has as its deriva- 
tive the function f(x) under the sign f in this integral. 


This is Cauchy’s version of the fundamental theorem of calculus. Later in the 
same lesson he gave the formula [p. 104; 155] 


/ ~ f(x) dx = F(x) — F(xo), 
x0 


where F is a function such that F’(x) = f(x). 

In the twenty-seventh lesson Cauchy turned to indefinit integration, promi- 
nently displaying what he termed the arbitrary constant [p. 105; 157] of inte- 
gration, and gave a method that he called integration by substitution as follows 
[pp. 107; 159-160]: if y = f(x) and if instead of x we substitute another vari- 
able z related to the firs by x = x(z), then 


7 fides i (2) dz, 


where f(z) = f[x()]x’@). 
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Then integration by parts was given in the form that has become standard 


[p. 108; 162]: 
fudvo=u— fvdu 


It is not necessary to fully describe the contents of the rest of this work. In 
general terms, it can be said that, with the notable exception of vector analysis 
(gradient, divergence, curl), the entire calculus sequence as taught today is 
contained in the Résumé. The converse is not necessarily true, as shown by 
formulas such as [p. 119; 179] 


/ [I(x + Vx? + 1)]" dx 
nV x2 +1 n(n — 1)x 


Pete TO Gy FeeD * fies Pp 


n(n — 1)(n —2)V x2 +1 e | +0 
[e+ V7 +0) | 


6.8 UNIFORM CONVERGENCE 


It may have come as an unwelcome surprise to Cauchy, once the conver- 
gence of series was established on a fir ground, that the Norwegian mathe- 
matician Niels Henrik Abel (1802-1829) noticed an incorrect theorem in the 
Cours d’analyse: the sum of a convergent series of continuous functions is 
continuous.** Abel wrote:49 


But it seems to me that this theorem has exceptions. For example the [sum 
of the] series 
sin. x — 5 sin 2x + £ sin 3x eee 


is discontinuous for every value (2m + 1)z of x, where m is an integer. 


Indeed, this is Fourier’s series expansion of y = x/2 valid for —z7 <x <az 
(see the example on page 379), and its convergence is guaranteed by Dirichlet’s 


42 Cours d’analyse de l'Ecole Royale Polytechnique, Chapter VI, 1.5' Tuzorem, [pp. 131- 
132; 120]. 

a “Untersuchungen tiber die Reihe 1 + (m/1).x + (m- (m—1)/1-2).x? + (m- (m— 
1)-(m—2)/1-2-3).x3 +--+ u.s.w,” 1826. French translation in Euvres completes. The 
stated quotation is in the footnote on page 225. 
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Portrait by Johan Gorbitz. 


1829 theorem. But Cauchy did not take Abel’s objection to heart at that time, 
for he restated the incorrect theorem in his Résumés analytiques, published in 
Turin in 1833.4 

We shall then seek additional conditions for the sum of a convergent series 
of continuous (alternatively, differentiable or integrable) functions 


Uy bug bees bun tee 


to be continuous (or, respectively, differentiable or integrable). Now let s 
denote the sum of the series and let s, denote the sum of the firs n terms. 
Using Cauchy’s letter ¢, to be fashionable, the definitio of convergence of 
the stated series can be rephrased as follows. Imitating Cauchy’s style in 
proving the theorem of Section 6.6, the convergence of the series 


Uy (x) + ug(x) +++ + Une) +> 


447 th Theorem, p. 46 = Guvres, Ser. Il, X, 1895, p. 56. 
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to a sum s(x) for a given x can be expressed as follows. Given any small 
positive number e¢, there is a positive integer N such that for n > N the partial 
sum 

Sn(X) = u(x) + ug (x) +++ + Un (X) 


will always remain larger than s(x) — € and smaller than s(x) + €. Since 
this definitio does not guarantee the validity of Cauchy’s theorem on the 
continuity of s, a natural question is whether it can be modifie so as to render 
Cauchy’s theorem valid. 

The fact that a convergent series of continuous functions can have a discon- 
tinuous sum was examined, about 1848, by Phillip Ludwig von Seidel (1821— 
1896) 45 and, independently, by Sir George Gabriel Stokes (1819—1903).*° 
They noticed that, for a given €, no N can be chosen so that the inequalities 
in the preceding definitio of convergence are satisfie for all x near a point 
of discontinuity of s. That is, while for each such x there is an N such that 
the stated inequalities hold, the same N does not work for all x. We could 
then conjecture that if we replace the definitio of convergence with one that 
requires N to be valid for all x, then this more demanding kind of convergence 
should make it impossible for the sum of the series to be discontinuous. 

It turns out that in 1838, Christof Gudermann (1798-1852) had already 
introduced a kind of convergence at the same rate (einem im Ganzen gleichen 
Grad der Convergenz), which is the precursor of the one that we seek,*” 
but the importance of the concept was realized by Karl Theodor Wilhelm 
Weierstrass (1815-1897), who would later become one of the main forces 
in making analysis rigorous. Weierstrass was a student at the University of 
Bonn, who in 1839 transferred to Miinster to attend Gudermann’s lectures (it 
was then the custom in Germany for students to switch universities at some 
time). It is quite likely that Gudermann and Weierstrass discussed the new 
concept of convergence. Weierstrass never earned his doctorate, and became 
a Gymnasium (high school) teacher in 1841. Such was the amount of first 
rate research that he produced from this moment on that it eventually earned 
him a position at the University of Berlin in 1856. This research remained 
unpublished for a long time, but the fact that he referred to a power series 
that converges uniformly—gleichmdssig convergirt—in an 1841 manuscript, 
Zur Theorie der Potenzreihen, supports the idea that he may have learned 
about it from Gudermann. This manuscript contains a definitio of uniform 


45 «Note iiber eine Eigenschaft der Reihen, welche discontinuirliche Functionen dar- 
stellen,’ 1847-49. 

46 “On the critical values of the sums of periodic series,” 1848. 

47 “Theorie der Modular-Functionen und der Modular-Integrale,” 1838, pp. 251-252. 
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KarL WEIERSTRASS 
From Weierstrass’ Mathematische Werke of 1894. 


convergence for a series of functions of several variables, which Weierstrass 
restated in a more polished form in “Zur Functionenlehre,” a paper of 1880.48 


An infinit series 
CO 
dios 
v=n 


whose members are functions of arbitrarily many variables, converges uni- 
formly in a given part (B) of its region of convergence, if after the assumption 
of an arbitrarily small positive quantity 5 a whole number m can always be 
determined, such that the absolute value of the sum 


lee) 


Le fy 


v=n 


48 With very minor modifications this translation is from Grattan-Guinness, “The emer- 
gence of mathematical analysis and its foundational progress, 1780-1880,’ in Grattan- 
Guinness, From the calculus to set theory, 1630-1910. An introductory history, 1980, 2000, 
p. 134. 
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is smaller than 6 for each value of n, which is > m, and for each collection 
of values of the variable belonging to the region B. 


Cauchy was unaware of these facts and, quite independently, when he 
finall saw the light on this point, introduced the concept of uniform conver- 
gence (but not this name) in 1853.49 Then he proved the corrected version of 
his earlier theorem that motivated the preceding discussion. In current termi- 
nology, the sum of a uniformly convergent series of continuous functions is 
continuous.°° 

As in Cauchy’s case regarding ordinary convergence, the importance of 
Weierstrass’ contribution stems from the fact that he realized the usefulness 
of uniform convergence and incorporated it in theorems on the integrability 
and differentiability of series of functions term by term. 


49 “Note sur les séries convergentes dont les divers terms sont des fonctions continues 
d’une variable réelle ou imaginaire entre des limites données,” 1853 = Guvres, Ser. I, XI, 
1900. 

50 Cauchy’s own statement of this result (Theorem IJ) is not very clear. He required the 
sum Uy, + Un+1 +-+++U,/_1 to become “always infinitel small for infinitel large values 
of the integers n and n’ > n” [pp. 456—457; 34-35], but did not mention any values of x 
for which this should be true. Did he rely on the word “always” to mean “for all x in the 
domain of the stated functions”? 
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De lateribvs et angvlis triangulorum 63, 421 

de Moivre, Abraham 173-178 

de Moivre’s formula 178, 179, 188 

De quadratura (Leibniz) 340, 341, 347-349, 
391, 433, 434 

De quadratura (Newton) 227, 295, 312, 313, 
315, 317, 360, 439 

De revolvtionibvs orbium celestium 63, 421 

de Sarasa, Alfonso Antonio 119, 120, 122, 
418, 445 


see Alembert, 
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De triangvlis omnimodis 57,58, 62, 444 
decimal 
—notation 62, 84-88, 96, 105, 106, 109, 


111, 128, 167, 184 
—point 84, 85,105, 106, 109, 111, 121 
degree (arc) 3,4, 8, 10, 11, 18, 20, 21, 24, 
28, 34, 38, 99, 177, 186, 426, 441, see 
also minute (arc), Ptolemaic unit, second 
(arc), zodiac 
circle divided into 360 —s 3,8, 28 
Delambre, Jean-Baptiste Joseph 33, 38, 422 
Delarame, Francesco 79 
Denmark 75, 80, 129, 186 
depressed cubic 148, 150, 153, 159, 160, 171 
derivative 143, 144, 158, 178, 223-225, 228, 
229, 234, 240, 262, 266, 275, 278, 280, 
295, 311, 328, 332, 361, 370, 375, 376, 
397, 399, 401-403, 406, 417 
Descartes, René 15, 83, 173, 234, 239, 246, 
247, 260, 278, 318, 336, 354, 414, 422, 


425 

Descriptio 81, 82, 86, 89, 90, 100, 438 

d’Espagnet, Etienne see Espagnet, Etienne 
d 

diameter see Ptolemaic unit, trigonometric 
lengths 


—ofthe firs shadow 53 
— ofthe second shadow 53 
—ofthe shadow 39,54 
difference 30-33, 110-116 
backward 111,113,114 
mean 116 
— quotient 237,394, 397, 399, 401 
table of—s 111-114,217 
differentiability 411 
differential 
—calculus see calculus, differential 
—equation 89,90, 334, 352, 354, 355, 
357, 359, 365, 370, 371 
ordinary 377 
partial 369,377 
differentiation see also integration 
implicit see implicit differentiation 
logarithmic see logarithmic differentiation 
—ofpower 311,332,335, 340, 352 
—ofrational power 263, 264, 275 
tules of 289, 332, 335, 347, 352, 355 
difform motion or quality 231-233, 420, 441 
Diophantos of Alexandria 236, 243, 422 


Dirichlet, Johann Peter Gustav Lejeune 393, 
408, 422 

dirimendo 22 

Dositheos of Pelusium 199 

double 
— angle formula 
—arc formula 38 

Dublin 191 

Duns Scotus, John 230 


68, 99 
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e 136, 139, 332, 384 
— as base of logarithm 141 
—inEuler’s formula 183 
value of 139 
earth 3,5 
distance of sun and moon from 6 
eclipse 5 
Edict of Nantes 173 
Edinburgh 78, 103, 221, 361 
University of 105,214 
Edward VI of England 79 
Edwards, Charles Henry 130, 234, 423 
Egypt 8, 34, 197, 374, 375, see also Rhind 
mathematical papyrus 
Elements see Euclid’s Elements 
Elert, Glenn 12 
England 101, 129, 173, 234, 359 
epsilon-delta proof see €-d (symbol) 
equidifferent logarithms 92,96 
equinox 1,7 
Espagnet, Etienne d’ 425 
Eucleides of Alexandria see Euclid 
Euclid 8, 12, 22, 197, 198, 202, 241, 423 
—ean geometry 4, 14, 15,21, 194, 203, 
241, 275, 279, 288, 291, 361, 437, 441, 
442, see also geometry 
—’s Elements 9, 12,21, 22, 36, 37, 41, 56, 
158, 197, 241, 423 
Eudoxos of Cnidos 200 
Euler, Leonhard 77, 136-146, 172, 176-185, 
191, 195, 206-212, 220, 332, 365-367, 
371-373, 378, 380, 384, 385, 399, 423 
Euler’s 


— formula 182-184 
—number see e 
Europe 34, 55-57, 61, 65, 80, 84, 101, 117, 


149, 150, 181, 201, 213 
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Exercitationes geometrice 
216, 217, 227, 429 
exhaustion 
method of 200, 230 
exponential function 137, 139, 140, 176, 
181-183, 357, 385, 390, 396 
general 384 
series expansion of 
183, 390, 396 


130, 142, 195, 


139, 140, 176, 182, 
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Faber, John 174 

factorial 222 

fecundus 74, see also tabula fecunda, 
trigonometric lengths 

— residue 67 

Fagnano, Giulio Carlo de’Toschi di 184 

Feingold, Mordechai 358, 426 

Fermat, Pierre de 234-257, 260, 274, 282, 
283, 293, 294, 329, 333, 336, 414, 425, 
437 

fift root 

extraction of 106 

Fincke, Thomas 75, 76,426 

finit differences see differences 

Fior, Antonio Maria 150 

firs ratio 306, 308, 316-318, 368, see also 
last ratio, ultimate ratio 

firs shadow 39, 53, 54, see also trigonomet- 
ric lengths 

Flensburg 75 

Florimond de Beaune 354 

fluen quantity 227,295,296, 302, 312, 315, 

316, 335, 395, 397, 439 

142, 220, 221, 227, 228, 291, 292, 

294-296, 298, 299, 302, 312, 315-318, 

333, 335, 360, 361, 394, 395, 397-400, 

412, 431, 433, 437-441 

folium of Descartes see galande, la 

Fontana, Nicolo _ see Tartaglia, Nicolo 

Fourier, Joseph 373-380, 385, 386, 390, 
392, 393, 405, 426, 429 

Fourier series 392, 393, 407, 422 

Fourier’s method for heat equation 
also separation of variables 

fractional exponent 130, 133, 135, 188 

Frangais, Jacques Frédéric 188, 426 

France 56, 189, 234, 373, 374, 386, 398 


fluxio 


377, see 


Franks 35 

Freeman, Samuel 105 

Friis, Frederik Reinhold 61 

function 215 

fundamental 
— theorem of calculus — see calculus, 

fundamental theorem of 

—vibration 369,371,373 

Fuss, Nicolaus 424 


Fuss, Paul Heinrich 184, 424, 426 


G 


Gadeke, Nora 436 

galande,la 246, 247, 282, 283 

Galileo Galilei 234 

gambling 151 

Ganges River 25 

ganitapada 26, 27, 30, 33 

Gardiner, William 138, 426 

Gauss, Carl Wilhelm Friedrich 
194, 386, 427, 432 

Geber _ see Jabir ibn Afla al-Ishbili, Abu 
Mohammed 

Geille, Amédée Félix 374 

Gellibrand, Henry 116, 418, 441 

Gelon 3 

Genghis Khan see Temujin 

geography 1,9, 10, 232, 437, 443 

Geometrie pars vniversalis 122, 226, 
258-267, 269-273, 428, 429, 443 


190, 191, 


geometric 
—homogeneity 264 
—methods 5, 6, 93, 98, 119, 150, 156, 


159, 160, 169, 182, 258, 264, 271, 275, 
277, 280-282, 288, 294, 302, 310, 324, 
335, 358 

— motion 82, 89,91, 93, see also 
arithmetic motion 

— progression 83, 84, 101, 118-120, 127, 
195-198, 248-254, 355, 418, 425, see 
also continued proportion 

—ratio 366 

— representation of complex number 
185-188, 190, 191, 194, 413, 445, 449 

— representation of intension 232 

—series 201,202, 206, 390 
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192, 197, 205, 225, 234, 275, 315, 318, 

322, 369, 389, 439, 449, 450 half 
Alexandrian 239 —angle formula 19 
re 361 —arc formula 18, 23, 38 
panne ie . half-chord 26, 28, see also trigonometric 
Euclidean see Euclidean geometry lengths 
non-Euclidean 194 table of —s 29,37 


professor of 
Savilian 
solid 233 
Gerhardt, Carl Immanuel 
326, 328, 330, 331, 334, 391, 415, 420, 
427, 433-436, 446 
Germany 75, 129, 330, 340, 409 
Gherardo de Cremona 56, 75, 443 
Girard, Albert 172, 428 
gitikapada 30 


115, 131, 449 


Glaisher, James Whitbread Lee 86, 428 
gnomon 1, 7,37, 443 

Golapada 30 

Goldbach, Christian 184 

Goldstine, Herman Heine 116, 428 


Gonzalez-Velasco, Enrique Alberto 428 

Gérbitz, Johan 408 

Gowing, Ronald 182, 421, 428 

Grattan-Guinness, Ivor 375, 376, 410, 426, 
429, 443 

great circle 4 

Greek see also Hellenic 

—astronomy and mathematics 1, 2, 7, 8, 

10, 13, 26, 37, 156, 422, 430, 443, 447 

Gregory, James 120-122, 130, 141-143, 
195, 214—227, 234, 258-272, 274, 275, 
278, 282, 290, 292, 294, 318, 329, 334, 
339, 349, 399, 421, 428, 429, 443, 447 

Gregory—Leibniz formula 349 

Gregory’s interpolation formula 

Grenoble 374 

Gresham College 101, 103, 116, 220, 441 

Gresham, Thomas 101 

Grooth, Johann Niklaus 372 

Grunert, Johann August 356, 450 

Gudermann, Christof 409, 429 

Gundisalvi, Dominicus 56 

Gunter, Edmund = 77, 429 

Gupta empire 25 

Gustavus Adolphus Il of Sweden 117 

Gutas, Dimitri 36, 429 


217-220 


173, 206, 304, 318, 


Hall, Alfred Rupert 289, 358, 416, 421, 429 

Halley, Edmund 142, 359, 429, 438, 442 

Hamborg, Otto 341,434 

Hamilton, William Rowan 190-194, 430 

Handmann, Jakob Emanuel 136 

Handson, Raph 76, 443 

Hanover 318, 334,340 

Hansen, Friedrich Adolf 340 

Hansen, Hans Peter 61 

Hardy, Arthur Sherburne 

harmonic 
—progression 196 


188, 413 


—series 203,204, 206, 390 
—vibration 196, 369,371 
Harran 35 


Harun al-Rashid 36 
Hashashin 43 
Haskins, Charles Homer 56, 430 
Hawking, Stephen 83, 173, 413, 420, 426, 
430, 440 
heat equation 375-378 
Heath, Thomas Little 3, 199, 413, 423, 430 
heliocentric system 2,3 
Hellenic see also Greek 
— civilization 1,8, 26,34, 197 
Henri Wl of France 65 
HenrilV of France 65 
Henry VII of England 79 
Hermann of Carinthia 56 
Heytesbury, William 230, 231, 430 
Highmore, Joseph 114 
Hille, Einar Carl 191, 430 
Hindu _ see also Indian 
— astronomy and mathematics 26-29, 32, 
37, 445 
—numbering 37,56 
Hipparchos of Niceea 1, 7, 8, 10, 447 
Hispalensis, lohannes 56 
Hispania 35 
Hodgson, Marshall Goodwin Simms 43, 431 
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Holl, Benjamin 276, 319 

Holstein 129 

Holtzman, Wilhelm 236, 422 

Hopwood, William 370 

Hospital, Guillaume Francois Antoine de 
l see Hospital, Guillaume Frangois 
Antoine de 

Hoiiel, Guillaume-Jules 188, 413 

Hudde, Johann 204, 234, 304-306, 330, 336, 
431 

Hughes, Barnabas 58, 59, 444 

Hugo de Santalla 56 

Huguenots 173 

Huilier, Simon Antoine Jean |’ 
Simon Antoine Jean 

Hulagu 43,44 

Huygens, Christiaan 

Hveen 80 

hypotenusa 
lengths 

hypotenvsam residue 67, see also 
trigonometric lengths 


see |’Huilier, 


203, 204, 234, 282, 347 


66, 67, 74, see also trigonometric 


i 177,187,191, 194 
—inEuler’s formula 183 
Iberian peninsula 55 
imaginary number 145, 164, 165, 170, 173, 
177, 181-183, 191, see also complex 
number, men di meno, pitt di meno 
implicit 
— differentiation 247, 296, 297, 302, 303, 
305, 307, 334, 335 
— function 282, 304, 403 
India 25, 26, 33, 181, 201 
Indian see also Hindu 
— astronomy and mathematics 
213,413 
Indus river 35 
infinit series see series, infinit 
integrability 344,411 


25, 26, 36, 


integral 
improper 405 
table of —s 302,359 
—test 392 
integration 
—by parts 222, 286, 288, 322, 329, 407 
indefinit 302,397, 406 


interplay between differentiation and 269, 
294, 358, 359 
—ofpower 332 
—ofrational power 274 
interpolation 33,55, 115, 116, 215,219, 220, 
226, 227 
Introductio in analysin infinitorum 77, 137, 
176, 424 
Tran 38,43 
Iraq 37 
Irinjalakuda 212 
Irish Academy 192 
irreducible cubic 160, 161, 163, 168, 171, 
172 
Isére 374 
Islah al-mjsty 55 
Islam 34,35 
Islamic 
— astronomy and mathematics 36, 37, 43, 
55, 84, 419, 422, 445, 450 


—culture 55 
—empire 35,36 
—world 34,37, 43,55, 84 
Istahr 36 
Istanbul 7 


Italy 57,63, 122, 234, 258, 260, 364 
Ivan IV (the Terrible) of Russia 79 
Iznik 7 
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Jabir ibn Af ah al-Ishbili, Abu Mohammed 
55,58 
James VI of Scotland 80 
Jean de Liniéres 57 
Jones, William 138, 181, 208, 214, 291, 292, 
295, 313, 316, 333, 431, 438-440 
Juan de Sevilla 56 
Julius Cesar 7 
Jya_ 26, 30, see jyva 
Jya-ardha 26, 28, 32 
Jyb 37, see also jyba 
translated as sinus 56,57 
jyba 37-42, 45-54, see also jyva, sine, sinus, 
trigonometric lengths 
— of double and half arc 38 
—ofsum or difference 40—42 
—ofthe complement 37-39, 42, 53,54 
table of 37 
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Jyesthadeva 212, 349, 432 Battle of 117 
Jyva_ 26-34, 37, 213, see also jyba, sine, University of 63 
sinus, trigonometric lengths Leén 56 
computation of —s 27,28, 32-34 level shadow 39, see also trigonometric 
reverse-order see utkramajya lengths 
table of —s 26,28, 29 V’HO6pital’s rule 362 
l’Hospital, Guillaume Francois Antoine de 
K 361-364, 415, 431 
V’Huilier, Simon Antoine Jean 397, 398, 401, 
Kanka 36,37 431 
Kashan 55 Liber algebre et almucabola 37,56 
Kepler, Johannes 101, 226, 233, 432 Liége 305 


Kerala 25,201, 212, 213, 349, 443 

Khwarazm 37, 43 

Kitab al-jabr wa’l-muqabala 

Kitab shakl al-qatta 44,412 

Kitty Hawk 403 

Kneller, Sir Godfrey 123 

Knobloch, Eberhard 340, 341, 432, 434, 436, 
446 

Knott, Cargill Gilston 

K6nigsberg 57,191 

Kopernik, Mikotaj see Copernicus, Nicolaus 

Krakow 63 

Kusumapura 25 


37, 56 


78, 82, 105, 432, 438 
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La Roque Jean-Paul de 347, 434 

La Tour, Maurice-Quentin de 370 

Lagrange, Joseph-Louis 378, 398—401, 403, 
432, 433 

last ratio 306, 308, 310, 316-318, 368, see 
also firs ratio, ultimate ratio 

Latham, MarciaL. 83, 173, 422 

latitude of forms 230-232 

Lausanne 137 

law 

—ofsines 44,47—52, 60, 61, 74, 75 
—oftangents 74,75 

Lectiones geometrice 275,277, 278, 282, 
284, 303, 333, 358, 414 

Leibniz, Gottfried Wilhelm 130, 144-146, 
173, 182, 203-206, 234, 312, 315, 
318-362, 364, 391, 405, 414-416, 420, 
426-429, 431-436, 438, 440, 442, 446 

Leibniz formula 349 

Leibniz’ rule 340 

Leipzig 


Lilly, William 101,436 
limit 311, 316, 367, 368, 389 
Lincolnshire 127 
Lisieux 203 
logarithm 81, 82, 86, 90-110, 115-117, 
119-122, 125, 127-130, 133, 135, 136, 
138, 141-147, 177, 178, 182-185, 214, 
219, 220, 320, 322, 331, 355-357, 379, 
385, 386, 402, 418, 423, 426, 430, 433, 
437, 438, 441, 445 
—asexponent 138,385 
base of _ see base of logarithm 
Briggsian 101, 106, 120, 122 
common 143 
computation of —s 81, 90-99, 105, 
108-110, 115-117, 130, 141-143, 429, 
432, 437, 438 
— by method of ffth root 106 
— by method of square root 106, 109, 
110, 141, 143 
—by radix method 86, 428 
derivative of 143, 144, 365 
hyperbolic 117, 120, 122, 128, 130, 135, 
136, 141, 143, 322, 385, 396 
graphof 122 
inf nitude of—s 146, 147, 183-185 
limits (lower and upper bounds) of 90-96 
natural 141, 143, 144, 182, 355, 385 
Neperian 88, 102, 104 
— of complex or imaginary number 145, 
147, 181-185 
— of negative number 
— of product or quotient 
origin of word 90 
series expansion of 139, 176,212 
table of—s 81,92, 96, 106-110, 115, 138, 
438, 441 


144, 145, 147 
102-104, 115 
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logarithmic 
—computation 142 
—curve 355 
— differentiation 178, 185, 403 
— function 137,138, 140 
—relationship 119,355 
—series 130 
—table 82, 96,99 


— test for convergence 391 
Logarithmorum chilias prima 103, 115 
Logarithmotechnia 129, 130, 290, 437, 444, 
449 

London 101, 103, 127, 195, 225, 227, 315, 
330, 333, 358, 428 

Liitzen 117 

Lyceum 
Lyceum 
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Macdonald, William Rae 
Macedonia 8 
Machin, John 220 
MacLaurin, Colin 
437 
Maclaurin series 220-223, 225-227, 229, 
315, 400, 437, 439 


81, 103, 104, 438 


220, 221, 223, 225, 361, 


Madhava of Sangamagrama 212, 214, 349 
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Mahoney, Michael Sean 239, 248, 437 
Mainz 203 

Manduith, John 57 

Maraga 44 


Marcos de Toledo 56 
Marischal College 215 
Marseilles 56 
Marw 35,37 
Mary Tudor Queen of Scots 79 
mathematician 2,26, 37, 43, 65, 75, 80, 120, 
128, 145, 146, 150, 172, 173, 182, 188, 
199, 234, 304, 359, 361, 391, 392, 407 
public contests between —s 150, 152 
mathematics 151,275 
enemies of 146 
professor of 63, 101,355,412 
Lucasian 391,412 
Mathematike syntaxis see Almagest 
Matthias I Corvinus of Hungary 63 
Maximilian II 63 


see Alexandrian Lyceum, Peripatetic 


Mayer, Uwe 318, 436 

mean differences see differences 

mean proportional 54,254, 358 

mean value theorem 403, 406 

Medina (al-Madinah an Nabi) 35 

men dimeno 164, 172, see also imaginary 
number 
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Mengoli, Pietro 206 
Mercator, Nicolaus 129, 130, 290, 348, 437, 


1,7, 8, 10 


444, 449 
Merchiston 81, 101, 102, 106, 438 
seventh laird of see Napier, Archibald 
eighth laird of see Napier, John 
— Campus of Napier University 78 
—Tower 78 
Mersenne, Marin 119, 120, 234, 246, 445 
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Meshed 43 
Methodus 234, 236, 241, 243, 282, 425 
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Metz 188 
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422 

Milan 151 


minute (arc) 10, 11, 13, 27, 28, 30, 31, 38, 
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miqyas 37 
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see Descriptio 
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see Constrvctio 
Mohammed ibn Abdallah 34, 35, 43 
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—as product in physics 320,321 
—as small amount of time 89 
— in method of fuxions 297-299, 311, 
360, 439 
Mongol empire 43 
moon _1,3-6 
distance of — from earth 6 
Moore, Jonas 77,437 
Motte, Andrew 215,307,360, 440 
Mourey, C.V. 190, 437 
Muawiyya ibn Abu Sufyan 34 
Miller, Johannes see Regiomontanus 
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Napier, John 78-106, 437, 438 
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Nasir al-Din, Nasir Eddin, Nassiruddin-el 
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Navarre, Collége de 203 
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Necker, Jacques 373 

Newton, Isaac 15, 70,71, 115, 122-125, 
127-130, 132-136, 138-142, 172, 173, 
181, 182, 195, 209, 214-218, 220, 221, 
226-229, 234, 257, 275, 281, 289-295, 
297-307, 309-317, 330, 331, 333-335, 
339, 356, 358-360, 368, 369, 394, 438, 
439 

Newton, John 

Nicea 1,2,7 

Nicomachos of Gerasa 36 

Niewentiit, Bernardo 357,359, 436 

Nilakantha Somayaji 201,212,445 

Nizari Ismailis 43 

Nordgaard, MartinA. 186 

Norway 186 

Nuremberg 63 


77, 441 


O 


observatory 44, 80, 191 

Oldenburg, Henry 130, 134, 206, 286, 290, 
305, 306, 334, 347, 354, 421, 428, 429, 
434, 440, 445 

omn. 319 

On the sizes and distances of the sun and moon 
3-6, 413, 430 

Oresme, Nicole 203, 232, 420, 437, 441, 442 

Otho, Lucivs Valentinvs 63, 64 

Oughtred, William 77, 86, 428, 442 
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444, 447 

Pappus _ see Pappos of Alexandria 

Paris 57, 65, 188, 203, 234, 334, 340, 361, 
364, 368, 373, 388, 393, 398 

—meridian 57 

Parmentier, Marc 341, 358, 434, 442 

Parr, Richard 101,418, 443 

Pascal, Blaise 123,234 

Pascal’s triangle 122, 124, 132 

Pataliputra 25 

Patna 25 

Pavia 150 

Pell, John 116 

Peripatetic Lyceum 2 

perpendiculum (perpendicular) of right triangle 
66-74, 171 

Persepolis 36 

Persia 8, 34, 36,38 

Petrus Alphonsi 56 

Peurbach, Georg 57 

Philip Il of Macedonia 8 

Philosophie naturalis principia mathematica 
see Principia 

pl see mz (symbol) 

Piatti Foundation 151, 152 

Pitiscus, Bartholomaéus 65, 76, 443 

piu di meno 164, 165, 169, 172, see also 
imaginary number 

plague 127,289 

planetary motion 

Plato of Tivoli 

Plato Tiburtinus 

Poitiers 35 
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Pomerania 117 

Portugal 380 

power series 225,291,399, 409 

Prague 101,117,386 

pricked letters 312,315 

principal table 96, 98, 99, see also logarithm 


13, 239-243, 442, 


24, 101 
56, 149 
56, see Plato of Tivoli 
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Principia 215, 218, 220, 307, 317, 360, 369, 
440 
Principios mathematicos 
394, 421 
Probst, Siegmund 318, 436 
progression — see arithmetic progression, 
geometric progression, harmonic 
progression 
prosinus 74, see also trigonometric lengths 
—complementi 74 
prosthapheresis 80 
Ptolemaic unit 26 
diameter divided into 120? 10 
perimeter divided into 360° 10 
Ptolemaios, Klaudios see Ptolemy, Claudius 
Ptolemais 2,8 
Ptolemy lof Egypt 8 
Ptolemy, Claudius 1, 7—24, 32, 36, 38, 
42-44, 56, 57, 443 
Ptolemy’s theorem 15, 17, 19, 42, 43 
Pulzer, Aron 387 
Pythagorean 
— study of music 196 
—theorem 20,46 


380, 381, 385, 386 
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quadratic equation 61, 148, 149, 168, 170, 
178, 240, 241 
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221, 225-227, 230, 233, 248, 252, 253, 
256, 260, 275, 283, 286, 288, 289, 293, 
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336, 337, 339-342, 347, 348, 350, 357, 
358, 415, 425, 429, 432-434, 436, 
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squaring 
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340, 346, 348, 356, 413 

quotient rule 224, 290, 336, 339, 354 
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Rajagopal, Cadambathur Tiruvenkatacharlu 
212, 349, 443 


Rangachari, M.S. 212, 349, 443 
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429, 443 
Recorde, Robert 86,444 
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Regiomontanus 

Regiomontanus 57—64, 67, 75,77, 84, 444 

Renaissance 56,57, 84, 156 

residue of a binomial 164, 167, 169, see also 
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Rheinhold, Erasmus 99 

Rheticus, Georg Joachim 63, 64, 66, 84, 444 
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Rhodes 2,7 

Ricci, Michelangelo 305, 444 

Richard of Wallingford 57 

Riemann, Bernhard 234 
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Roberval, Gilles Personne de 234, 248, 425, 
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Rome, Adolphe 13, 444 
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Rouen 203 
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Rudolf II 101 
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128, 130, 214, 220, 306 
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Sacrum Romanum Imperium 101 
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Saint Gallen 55 

Saint Luke, College of 380 

Saint-Vincent, Grégoire de 
318, 445 

Samarkand 35,55 

Samarra 35,37 

Samos 1-3 

Sanskrit 25, 28-30, 212, 413 

Sarasa, Alfonso Antonio de _ see de Sarasa, 
Alfonso Antonio 


117, 119, 120 
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Sarma, K. Venkateswara 27, 30, 414, 432 
Sasanians 36 
Sastri, K. Sambasiva 201,212,445 
Sauvetat, Raimundo de 56 
Savasorda see Bar Hiyya, Abraham 
Savile, Henry 115 
Scherrer, Friedrich Robert 424 
Schénberger, Otto 446 
Schooten, Frans van 
Schrader, Julius 393 
Scot, Michael 56 
Scotland 78, 80 
secant 39,54, 67, 75, 77, 222, 223, 349, see 
also trigonometric functions 
— ofthe rest 67, see also trigonometric 
lengths 
series expansion for 
table of—s 63 
second (arc) 10,11, 13,38, 63, 426, 427, see 
also degree (arc), minute (arc) 


214,221 


second shadow 53, 54, see also trigonometric 


lengths 
Sefardi, Moshé 56 
Sefrin-Weis, Heike 
Segovia 56 
Seidel, Phillip Ludwig von 
separation of variables 380 


318, 436 


409, 445 


series 
binomial 115, 122, 125, 126, 130, 219, 
220, 227 
infinit 124-130, 132, 139-143, 183, 195, 


198, 201, 206, 209, 211-213, 217, 229, 
232, 275, 289, 291, 293-295, 302, 307, 
317, 334, 369, 372, 373, 378-380, 388, 
399, 410, 432, 441 


Seville 55 
sexagesimal 11, 13,38, 42, 84 
shadow 1, 7, 37-39, 43, 53, 54, see also 


diameter, firs shadow, level shadow, 
second shadow, straight shadow, 
trigonometric lengths, turned shadow 

Shakespeare 81 

Sharma, P.D. 201, 445 

Shaw, Gideon 223, 225,226 

Shrites 34 

Shukla, Kripa Shankar 27, 30, 413, 414 

Shute, Charles 196, 444, 445 

Sicily 56 

Sind 36 


see van Schooten, Frans 


Sindhind 36 
sine 5, 11,57, 58, 62, 67, 70, 75, 77, 88-100, 
177, 179, 183, 223, 378-380, see also 
Jyba, jyva, sinus, trigonometric functions 
computation with—s 84, 87 
inf nite series of —s 369, 372, 373, 378, 
379 
law of —s _ see law of sines 
—ofsum or difference 32, 39-43, 70 
— ofthe complement 98, 100 
—ofthe rest 67, see also trigonometric 
lengths 
product of —s 80 
relation of chordto 17,19, 20 
relation of jyva or sinus to 31, 33,99, 213 
series expansion for 181, 183,212,214, 
226 
— as product of factors 208-211 
table of —s 37, 39, 59, 62, 64, 67, 80, 88, 
96, 99 
versed see versed sine 
—wave 369 
whole see whole sine 
Singh, Avadesh Narayan 30 
sinus 59-61, 75, 85, 182, see also sine, 
trigonometric lengths 
—as translation of jyb 56,57 
—complementi 77,182 
sine as synonym for 88 
— totus 59,62, 84, 85, 88, 102, 177, see 
also whole sine 
slope 261, 262,279, 285, 310, 337, 354 
Sluse, René Francois Walther de 234, 305, 
306, 318, 330, 331, 333, 336, 445, 446 
Smith, David Eugene 29, 57, 65, 83, 137, 
149, 152, 154, 155, 158, 173, 186, 192, 
235, 359, 388, 422, 424, 433, 435, 438, 
440, 446 
Smith, Robert 
Smyma 9 
solar year 7 
solstice 1,7 
solution of triangles 
67, 76, 418 
sorcerer 63 
Sosigenes 7 
Soudry 340 
Spain 56, see also Cordoba caliphate, Iberian 
peninsula 


182, 421 


10, 44, 49, 51, 58, 61, 
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Speiser, David 372 
square root 
extraction of 
194 
— of negative number 
190, 449 
squaring see also quadrature 
—ofcircle 118 
—ofcurves 312 
— of hyperbola 
Sri-Kundapura 201 
star(s) 1,37,62 
distance of fied 3 
Stedall, Jacqueline Anne 130, 131, 243, 412, 
413, 416, 418-420, 424-426, 435, 437, 
446, 447, 449 
Stevin, Simon 84 
Stifel, Michael 83, 100, 118, 123, 124, 446 
Stokes, George Gabriel 409, 446 
Stone,Edmund 414, 431 
straight shadow 39, 53, 57, see also 
trigonometric lengths 
Strato of Lampascus 2 
Struik, Dirk Jan 148, 152, 203, 209, 233, 
237, 283, 350, 361, 364, 371, 373, 424, 
425, 429, 431, 433, 435, 436, 438, 440, 
446, 447 
subnormal 289,290 
subscript 218, 334, 336, 341 
substitution 270,297, 301, 306, 354, 364, 
406 
subtangent 223, 246, 247, 259, 261-263, 
306, 328 
Suiseth, Richard 230-232, 295, 446 
sum of reciprocals of squares 206-209, see 
also p-series 
sun 1, 3-7, 26,37 
distance of — from earth 6 


13, 81, 106, 108, 109, 193, 


148, 163, 170, 189, 


118, 128, 418 


Sunnis 34 

supernova 62 

Surya Siddhanta 26-29, 31-33, 36, 418 
Syncriseos 239-241, 243 

syncrisis 239, 240, 242, 243 

Syracuse 3, 198, 199,413 

T 


tabula artificialis 82, see also artificia 


number 


tabula fecunda 63,67, see also tangent 
(trigonometry), table of 
tangent to curve 199, 200, 226, 233, 234, 
243, 246, 247, 258-262, 264, 268, 269, 
271, 272, 275-277, 279, 281, 283, 285, 
286, 288, 294, 298, 302-306, 309, 310, 
316-318, 322, 323, 329, 331, 333-337, 
339, 341, 343, 349, 351, 353-355, 357, 
414, 425, 433-436, 446, see also slope, 
subtangent 
tangent (trigonometry) 6 ,37 ,38 ,54 ,67 ,74, 
75, 77, 100, 222—225, 287, 288, 349, see 
also trigonometric functions 
law of —s _ see law of tangents 
—ofthe rest 67, see also trigonometric 
lengths 
series expansion for 214, 221, 223 
table of —s 37-39, 63, 67 
tanto used for unknown in equation 163, 
168, 179 
Tantrasangraha 
Tantrasangrahavyakhya 
Tarazona 56 
Tartaglia, Nicolo 150-153, 160, 426 
Taylor, Brook 220, 221, 223, 225, 227, 369, 


201, 212 
212, 349 


414, 447 

Taylor series 220, 221, 226-229, 315, 400, 
439, 447 

Taylor’s theorem 258, 397 

Temujin 43 


Theon of Alexandria 7, 13, 444 
Theon of Smyma 9 
thiende, De 84 
“thing” used for unknown in equation 153, 
158, 163, see also cossist 
Thirty Years’ War 101,117 
Thomas, Ivor 2,7, 10, 13,443, 447 
Thoren, Victor E. 80, 447 
thousand and one nights, The 36 
Toledan tables 55-57 
Toledo 55,56 
— School of Translators 56 
Toomer, Gerald James 7, 8, 10, 443, 447 
Torricelli, Evangelista 234, 305 
Toulouse 56, 234 
Tours 35 
Tractatus fluxionum 294, 295, 302, 312, 441 
translations 37 
— from Arabic into Latin 56 
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— from Greek into Pahlavi 36 
— from Persian into Arabic 44 
— into Greek and Coptic 36 
transmutation 346-348 
transsinuosa 74, see also trigonometric 
lengths 
Treatise on the transversal figur 
shakl al-qatta 
triangle inequality 344 
Trigonometria britannica 
trigonometric 
— formula for area of triangle 61 
—functions 10,54, 58,59, 177, 222, see 
also cosecant, cosine, cotangent, secant, 
sine, tangent (trigonometry) 
abbreviations for 76,77, 177,371,377 
— identity 17, 19-21, 42, 54, 65, 67, 74, 
80, 98, 171, 176, 393 
—lengths 32,37, 39, 43, 53, 54, 57, 66, 
67, 80, 177, 222, 223, see also basis, 
chord, diameter of the firs or second 
shadow, fecundus, feecundus residue, 
firs shadow, half-chord, hypotenusa, 
hypotenvsam residue, jyba, jyba of 
the complement, jyva, level shadow, 
perpendiculum, prosinus, prosinus 
complementi, secant of the rest, second 
shadow, shadow, sine of the rest, straight 
shadow, sinus, sinus complementi, sinus 
totus, tangent of the rest, transsinuosa, 
turned shadow, umbra recta, umbra 
versa, utkramajya, versed half-chord 


see Kitab 


116, 418 


—lines _ see trigonometric lengths 
—ratios 177 
—tables 38,55, 63-66, see also chord, 


cosine, half-chord, jyba of the com- 
plement, jyva, secant, sine, tangent 
(trigonometry) 
trigonometry 1, 4—10, 12, 18, 26, 33, 34, 
37-39, 43, 44, 48, 53-60, 62—68, 74-76, 
177, 340, 402, 432, 443, 447, 450 
Hipparchos founder of 7 
plane 1, 8,58, 63, 74 
spherical 1, 8, 53,58, 76 
Trinity College 288 
Trkkantiyur 201 
Tschirnhaus, Ehrenfried Walter von 350 
Turkey 7 
Turkmenistan 37 


Turnbull, Herbert Westren 130, 134, 214, 
223, 261, 263, 277, 303, 305, 331, 349, 
421, 429, 434, 439, 440, 443, 445, 447 

turned shadow 39, 53,57, see also 
trigonometric lengths 

Tus 35,43 

twelfth-century Renaissance 56 

Tycho Brahe see Brahe, Tycho 


U 


Ujjain 25, 33 
ult. 319 
ultimate ratio 310, 311, 317, 365, 368, 394, 
see also firs ratio, last ratio 
Ulug Beg, Great Khan 55 
Umayyad dynasty 34-36, 55 
umbra _ see also trigonometric lengths 
—recta 57 
—versa 57,62 
Unfinde 57 
uniform 
—convergence 407-411 
— motion or quality 230—233, 420, 441 
uniformly continuous 405 
universe 2,3 
Uraniborg 80, 447 
Usher, James 101,418, 443 
Uthman ibn Affan 34 


utkramajya 32, 33, see also trigonometric 
lengths 

Uzbekistan 37, 43 

Vv 


van Schooten, Frans 74, 201, 234, 304, 431, 
448 
vector 186, 187, 191,407 
Venice 150 
verse, astronomical and mathematical work in 
25-32, 213 
versed 
—half-chord 32, see also trigonometric 
lengths 
—sine 178,356 
Vestby 186 
Vienna 
Academy of 57 
University of 57 
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Vieta see Viéte, Francois 

Viéte, Francois 65-75, 77, 84, 85, 168, 
170-172, 201, 202, 239, 241, 242, 250, 
251, 254, 278, 447, 448, 450 

Vlacq, Adriaan 116,418 

von Lauchen, Georg Joachim 
Georg Joachim 


see Rheticus, 


Ww 


Walker, Evelyn 248, 448 

Wallis, John 127, 130-133, 172, 185, 234, 
248, 290, 295, 315, 421, 439, 449 

Waring, Edward 391, 449 

Warren, John 190,449 

Weierstrass, Karl Theodor Wilhelm 234, 
409-411, 449 

Werner, Johannes 80 

Wessel, Caspar 185-189, 191, 445, 449 

Wessel’s multiplication law 186-188 

White Mountain, Battle of the 101 

Whiteside, Derek Thomas 70, 115, 124, 126, 
127, 142, 181, 209, 214, 216, 218, 227, 
228, 289, 291, 292, 294-298, 303, 312, 


315, 316, 330, 333, 335, 360, 438-441, 
450 

whole sine 
totus 

Whood, Isaac 276 

Witmer, T. Richard 67, 69, 148, 152, 159, 
170, 239, 419, 448, 450 

Wittenberg 63 

Wittich, Paul 80 

Woepcke, Franz 38,450 

Wolf, Rudolph 356, 361, 450 

Wood, Robert 442 

Wright, Edward 81, 86, 100, 438 

Wright, Orville 403 


59, 84, 90, 103, see also sinus 


Y 
Yuktibhasa 212, 349, 432 

Z 

Zeller, Mary Claudia 48, 75, 450 


Zij tulaytulah see Toledan tables 
zodiac 3,4,28 


